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—INFRASTRUCTURE LOG 


_DAY 69: All we need is one specific piece of info. Gil 
almost had it, but his hand cramped. How are we supposed 
to find trusted business info when these massive volumes 
of disparate, conflicting information keep pouring in? 


_Gil just grabbed a stuffed panda. 


_DAY 71: The answer: IBM solutions for leveraging information. 
Now we can cleanse info and standardize source data fields 
for consistency and accuracy. I can create a single, 
comprehensive and accurate record of info across our source 
systems. Finally, I can provide a unified, trusted view of 
our information so everyone can make better decisions. 


_Our view of our data is now scalable. Just in time — І think 
we ran out of quarters. 


Information Management Download the Innovation and Competitive Advantage white paper: 
g IBM.COM/TAKEBACKCONTROL/ACCURATE 


Altova? MissionKit" 2008 — The secret to savings on top software tools. 


V Y AUTOVN 
missionkit" 2008 


Gear up for 
development excellence 


Take off with the Altova? MissionKit™, and uncover 
the secret to savings on top software tools. 


Spied in the Altova MissionKit 2008: 
e The world's leading XML development tools: Altova XMLSpy®, MapForce®, StyleVision® 
e Plus available options that add: Altova UModel®, DatabaseSpy™, DiffDog®, SchemaAgent®, SemanticWorks® 


The Altova MissionKit 2008 bundles up to eight Altova products for less than the price of two!* 
All MissionKits include XMLSpy, MapForce, and StyleVision at substantial savings, plus options 
are available that add up to five additional application development, data management, and 
modeling tools effectively free of charge. It's your first-class ticket to the power, speed, 
and simplicity of Altova's award-winning product line. Save a bundle! 


Download the Altova? MissionKit" 2008 today: www.altova.com 


*Pricing comparison is edition-specific and 
based on Altova MissionKit vs. any two individual 
licenses of the following included products: 


ALTOVA XMLSpy, MapForce, and StyleVision. 
e LJ 


A Fresh Approach to 


S Server has changed a lot since 1999, 
when SQL Server 7.0 launched 


and SQL Server Magazine began providing expe- 
rience-based content created by the most respect- 
ed members of the community. However, what 
hadn't changed was SQL Server Magazine's look 
and feel, which had grown dated and cluttered, 
leading us to realize that a redesign was past due. 
After examining reader surveys, Web traffic, gaps 
in Microsoft's documentation, and community 
resources, we reached three conclusions that led to 
the new approach we're introducing this month. 

First, readers still need in-depth, increasing- 
ly specialized, deeply technical content written 
by experts. Michael Otey, Kalen Delaney, Itzik 
Ben-Gan, Brian Moran, Michelle Poolet, Kevin 
Kline—these are the industry's top experts and the 
core of our team. These luminaries—and bright 
new stars—will continue to ensure that you get the 
inside information and tested solutions you won't 
find anywhere else. 

Second, coverage of database development, ad- 
ministration, and business intelligence (BI) is more 
relevant than ever. Because database professionals 
must maintain expertise in database development as 
well as administration, SOL Server Magazine will 
continue providing the best content in both areas. 
And we're kicking our BI coverage into high gear. 
A wider variety of BI content will cover both the 
SQL Server back end and the front end, including 
solutions using Microsoft Office, SharePoint, and 
Microsoft Office PerformancePoint Server 2007. 


hate to one-up my boss, Karen Forster, but 

this month's editorial about the changes in 
SQL Server Magazine fails to mention some 
important elements in the new design. | guess 
she thinks it's my job to tell you gout them. 
Oh wait.. .actually, she's right. Don't mis 


; these 


2008 с on page 7. л 
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Adding context to our technical problem-solving 
articles is a new column by BI expert Douglas Mc- 
Dowell, who pulls no punches and draws the line 
between hype and reality. 

Third, readers at all levels have an insatiable ap- 
petite for articles on the fundamentals of SQL Serv- 
er, database administration and development, and 
BI. We also see a new user base emerging as SQL 
Server becomes a requirement for using products 
ranging from Microsoft System Center Configura- 
tion Manager and System Center Operations Man- 
ager to Windows Server Update Services, Forefront 
Security Server, Microsoft Office SharePoint Server, 
and on and on. In response to this demand for fun- 
damentals content, each issue will include articles 
that let you test and expand your knowledge. 

SOL Server Magazine is dedicated to making 
your job easier. This redesign extends that mission to 
making it easier to read the magazine; interact with 
editors, writers, and fellow readers; and find related 
content on the Web. Our cleaner, more consistent 
design guides you through elements such as tables, 
figures, and code, without being overwhelming. 

Months of work went into this refresh, and I 
thank Diana May for leading the editorial efforts 
and managing the project, David Kirby for his de- 
sign leadership and excellence, and Kate Brown for 
making the production piece top-notch. 

Please tell me what you think of our new ap- 
proach and design. I promise we'll listen and re- 
spond to your feedback. Е 

InstantDoc ID 96976 


u need. (Well, some 


) Consider me your 


We've improved elements such as More on the 

Web icons, Learning Paths, and Editor's Notes to 
better provide you with supplementary information. 
Use these guides to learn more about an article or 
author and to access additional content on the Web. 


EDITORIAL 
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Karen Forster 


karen (0) windowsitpro.com) is editorial and 
strategy director for Windows IT Pro and SQL 


Server Magazine and former director of 


Windows Server User Assistance at Microsoft. 


A new 
column by 
BI expert 
Douglas 
McDowell 
pulls no 
punches 
and draws 
the line 
between 
hype and 
reality. 


i Шай 


Christan 
Humphries 


chumphries @ penton.com) is production editor 


for Windows IT Pro and SQL Server Magazine. 
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Q Editors Note 


биг readers 
are the best! 
In response to 
your great feedback, you'll 
see a new look and feel to 
SQL Server Magazine this 
month. But these changes 
aren't just skin deep. We've 
improved navigation, but 
what you'll appreciate most 
about these changes are 
the expanded BI coverage, 
more focus on SQL Server 
fundamentals needed to 
do your job, and a good 
balance of topics covering 
all the different versions 
of SQL Server currently 
deployed in your organiza- 
tions. | encourage you 
to take advantage of the 
additional resources we have 
on SQLMag.com and the 
opportunities to engage in 
your SQL community. Drop 
me an email at dmay@ 
sqlmag.com, and let me 
know how we can 
better service you today 
and tomorrow. 
—Diana May, 
technical editor 


matin 
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16 —Stacia Misner 


SSRS and MOSS integration promises better information access and 
simpler report administration, but what will you have to do to make 
it work? This first article of a two-part series shows you how to set up 
SSRS and MOSS’s information-integration environment. 


InstantDoc ID 96840 


FEATURES 


11 


Getting to Know 

Wait Stats 

—Andrew J. Kelly 

SQL Server s built-in DMV functionality offers 
all kinds of performance data. This month, we 
show you how to monitor wait stats on your 


29 


SSRS and MOSS 2007 


Cover/Cover Story Illustration: Sebastian Kaulitzki/Shutterstock 


Making the Most of 
Automatic Statistics 
Updating 

—Kalen Delaney 


Tune query performance by using UPDATE 
STATISTICS and two new trace flags in SQL 


SOL Server instance. Server 2005 
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Sharpen Your Skills: 
Routines and Clauses 


—Pinalkumar Dave 

SQL Server often features several ways to do 
the same or similar tasks. To select the best 
technique for the job, you need to understand 
the differences. 

InstantDoc ID 96745 


Grouping Sets, Part 1 
—Itzik Ben-Gan 

In this early look at SQL Server 2008, you'll 
learn about the new version's implementation 
of grouping sets, which are useful for analyz- 


InstantDoc ID 96805 
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Data Warehousing: 
Dimension Basics 
--Місһе!е A. Poolet 

As you plan and create your data warehouse, 
you need to do two things for your users: De- 
cide the level of detail you want—how finely 
grained your dimensions will be—and flatten 


hierarchies to make data more accessible. 
InstantDoc ID 96846 


Protect UDM with 
Dimension Data 
Security, Part 2 


—Teo Lachev 

Leverage SSAS stored procedures to integrate 
UDM with an external security service. 
InstantDoc 10 96763 


COLUMNS 


9 Еуе оп ВЇ: 
4 Darn Good Reasons for 
Not Building Cubes 
on Transactional Systems 
—Douglas McDowell 
Don't build that cube! OLTP—based 
cubes are usually poorly designed for 
BI and contain dirty data. Still not 


convinced? There are other compelling 
———_teasons as wel] — 0. 
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IN EVERY ISSUE 


3 Editorial: 
A Fresh Approach to 
SQL Server 
—Karen Forster 
InstantDoc ID 96976 


3 Your Savvy Assistant 


Ral] 


SQLMAG.COM Community 
Dialog: 

Too Soon for SQL Server 
2008? Readers Say “Yes!” 
—Anne Grubb 

InstantDoc ID 96876 


PRODUCTS 


40 Buyer’s Guide: 
Database Design Tools 
—Karen Bemowski 
The prerequisite for an efficient, 
high-performance database is a 
database design. Using one of these 
nine design tools will make you 
more productive and the resulting 
database more efficient. 
InstantDoc ID 96845 


43 Industry Bytes 
Diana May and Megan Bearly share 
insights from their conversations 
with Heroix and SQL Farms. 


—Derek Comingore 


WEB EXCLUSIVE 


Leveraging Managed Functions in Computed Columns 


10 Tool Time: 
SQL Nexus 
—Kevin Kline 
SQL Nexus is a powerful front-end 
visualization tool for detecting, 
analyzing, and troubleshooting 


— — — problemson SQL Server 2005. -. 
InstantDoc ID 96774 


15 Reader to Reader 
4] MarketPlace 


48 Mike’s QuickPicks: 
Connecting SQL Server to 
the IBM System i 

_____—Місһае! Otey — 
InstantDoc ID 96679 


48 Dilbert 


44 New Products 
—Jeff James 
Check out new and improved SQL 


Server—related products. 
InstantDoc ID 96788 


45 Product Review: 
SoftTree Technologies 
SQL Assistant 2.5 
—John Green 
Want to optimize your SQL coding 
sessions? SoftTree Technologies offers 
some helpful coding features with 
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Get an introduction to the concept of managed computed columns and see a simple 
example of one that implements CLR integration string-parsing functionality. 


InstantDoc ID 96239 
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Number of extra outdoor hours you gained 
this week thanks to DBArtisan. 
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So many databases, so little time. Embarcadero 


tool, all platforms. 


DBArtisan is the one powerful yet simple cross- 
platform solution for managing all your databases 


from a single console. 


See how one gets it done for Oracle, SOL Server, 
DB2, Sybase and MySOL. 


Visit www.embarcadero.com/dbartisan/ 


to download a free trial. 
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Community Dialog 
Too soon for SQL Server 2008? Readers Say "Yes!" 


[| their recent commentaries about the 
n impending release of SQL Server 2008, 
Michael Otey and Brian Moran homed in on what's 
clearly a hot button for SQL Server pros. (See “Тоо 
Soon for SQL Server 2008?" August 2007, InstantDoc 
ID 96028 and “Leapfrogging to Katmai,” June 2007, 
http://www.sqlmag.com/Article/ArticleID/96348/sql_ 
server_96348.html.) Readers responded to Michael 
‘and Brian’s columns by voicing their concerns 
about Microsoft’s 2008 “technology refresh” release 
coming on the heels of a major upgrade. Most of the 
responses fell into one of two categories: either “2008? 
We're still getting used to 2005!” or “We’ll probably 
just skip the 2005 upgrade and go straight to 2008.” 
Here’s a sampling of feedback from both online com- 
ments and email. 


“T agree with your editorial; every developer and 
architect I know is still coming to terms with the 
nuances of SQL Server 2005; 2008 is too tight of 
a turn.” 

—Austin Zellner 


“Not to sound naive, but the new major version- 
release schedule Microsoft has started is more about 
money than support. Yes, they are a company and are 
supposed to make money. But this new release is too 
soon for many of the larger customers. The changes 
to the BI area of SQL Server have many still learning 
and adjusting.... Adoption of tools takes time and 
resources to accomplish. Backward compatibility 
has always been an issue for Microsoft, [and] I have 
to wonder what problems will occur if people skip a 
release—which I can see becoming more common.” 

—Ric Williams 


“My opinion and that of other SQL Server users 
and DBAs that I’ve talked to is that we're all just 
now starting to move in the direction of SQL Server 
2005, either due to budget reasons or waiting for 
software vendors to certify their software on SQL 
Server 2005. Now we get word that in six months, 
Microsoft is coming out with the next version? We're 
wondering what the purpose of doing anything 
in SQL Server 2005 is, and we're thinking of just 
waiting for next year to try and adopt SQL Server 
2008. But that 1s only if our software vendors certify 
it for compatibility... It's a vicious cycle; it seems that 
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now that Microsoft has adopted this new release 
schedule, we're always going to be at least one, if not 
two, versions behind." 

—Greg Sopczak 


“We have only one SQL 2005 install here. The rest 
(20 production servers) are on SQL Server 2000... 
about half on Windows 2000. We are in a budget 
crunch, so we cannot come up with the $250,000 
or so to upgrade. At this point, we will probably be 
doing the calendar year 2009-10 jump from SQL 
Server 2000 to 2008." 


—Markus SQL 


GET ACTIVE ONLINE AT SQLMAG.COM! 


Check out this month's featured Community Dialog about SQL Server 


YOUR 
VOICE 


MATTERS! 


2008 at http://www.sqimag.com/Articles/Article|D/96028/96028.html and http:// 
www.sglmag.com/Article/ArticlelD/96348/sql. server. 96348.html. Be sure to log 


on, or if you're not already a registered user, click the green Register button at the 


top right of the page to sign up. 
AND FOR A DIFFERENT POINT OF VIEW... 


You wouldn't know it from readers' comments, but some people "can't wait for 
2008." Douglas McDowell is one of those who rejects the meme that it's too soon 
for a SQL Server upgrade—and he's ready to take his case for SQL Server 2008 to 
SQL Server Magazine readers. Read Douglas's defense of 2008 next month! 


“It’s too soon. We have just kicked off our upgrade 
of SQL Server 2000 to 2005, having about 30 servers. 
This will take at least a year due to dependen- 
cles between different applications and having to 
upgrade all our third-party software to the version 
[that's] supported on SQL 2005.... The benefits of 
upgrading to 2005 in our environment are minimal 
as 2000 15 very stable, so the main driver is the end 
of support for 2000.” 

—pranil 


“We had done a lot of work in preparation for 
upgrading from 2000 to 2005 and were just getting 
ready to start, when we heard the announcement 
about 2008. Now we'll be leapfrogging 2005 and 
waiting for 2008. For us the major issues are DTS 
to SQL Server Integration Services (SSIS) migra- 
tion, user testing, and upgrading the skills of our 
development staff." SOL 
—ragresti 

InstantDoc ID 96876. 

—Anne Grubb, Web lead editor 


October 2007 7 
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Create Dashboards and Scorecards with 
SQL Reporting Services and Dundas 


Use Dundas's industry-leading data visualization technology to 
instantly add dashboard and scorecard functionality to your 
reports in SQL Server Reporting Services 2005. 


Dundas Chart, Dundas Gauge, Dundas Map and Dundas Calendar for 
Reporting Services offer unique, seamlessly integrated data 
visualization functionality that can greatly improve the user 
experience in SQL Server Reporting Services 2005. These four 
products are included in the Dundas Dashboard Bundle for 
Reporting Services, giving you all you need to turn your reports into 
visually appealing dashboards and scorecards quickly and easily. 
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By visually enhancing your reports, you and your users have more Ez инин | 17] EVA 
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flexibilty and control over your data. 


Download full evaluation copies today and add advanced ‘Call Centre Digital Dashboard ТЕЛЕ 
business intelligence to your corporate reports. ере AL ^e 
— 222 | жу 
art сей г NONE EN i. R 
Char бе” ТЕ — A 
for G au | dann EN Los ru = 
hart aug | Е SOFA NA |. | LB) 
Саі 21. — MÀ SECUN i EN) 
for Reporting for Reporting лаг. 
Services = 
Microsoft 
www.d undas.com GOLD CERTIFIED Data Management Solutions 
WEE AME dE Partner 


Microsoft and SQL Server Reporting Services are registered trademarks of Microsoft 
Corporation in the United States and/or other countries. 
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4 Darn Good Reasons for 
Not Building Cubes on 


Transactional Systems 
SSAS lets you do things you shouldn't 


ou can use SQL Server 2005 Analysis Services 
(SSAS) to build OLTP-based cubes on rela- 
tional data models that aren't designed for business 
intelligence (BI). But just because you can, that 
doesn't mean you should. The sidebar “Reasons Why 

You Might Think You Should Build a Cube on a 

Transactional Data Model” lists several reasons why 

you might want to do this, but Гуе encountered four 

common problems with OLTP-based cubes that 
convince me that such cubes are a bad idea. 

1. OLTP-based cubes are usually poorly designed for 
BI. If you try to build a cube on a normalized 
schema, it’s likely to become a non-intuitive and 
overly complex mess because the cube is built on a 
data structure designed for an application, not for 
end-user reporting and analysis, and the result will 
be poor performance. Alternatively, modeling BI 
data to a star or snowflake design usually results 
in a well-designed cube structure with an intuitive 
structure of measures and their dimensions (for 
details on these design methods, see “Data Ware- 
housing: Dimension Basics,” page 32, InstantDoc 
ID 96813). 

2. OLTP-based cubes usually are full of dirty data. 
By definition, when you build a cube against 
your transactional system, you remove the extrac- 
tion, transformation, and loading (ETL) process 
that provides not only data movement but data 
cleansing and merging. Data can be bad no matter 
where it’s stored, but the ETL process includes 
business rules, consistency checks, lookups, com- 
parisons, and calculations that clean up data, merge 
it nicely with other data from multiple sources, and 
store it in a model designed for end-user reporting 
and analysis. 

3. OLTP-based cubes don't handle history. A trans- 
actional system keeps what history its application 
requires; for instance, a transactional system might 
only keep the last 90 days of data, or it might 
overwrite (aka update) a record that changes. 
Therefore, you can’t write historical reports or per- 
form trend analysis. A relational data warehouse 
is often designed with snapshot fact tables that 
take historical pictures of measures (e.g., inven- 
tory levels, account balances) over time and track 
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dimensional changes showing important trending 
of time-relative attributes. If your data source isn't 
tracking history, your cube can't reflect it. 

4. OLTP-based cubes still have to be processed. Given 
that OLTP systems are transactional, building an 
OLAP solution on top often begs for near real-time 
updates. You can use proactive caching for real- 
time updates, but it can yield unexpected results 
for dimensions, especially large ones. Processing 
causes a full read of the relational tables sup- 
porting the dimension, which can cripple an OLTP 
system. If you forgo proactive caching, traditional 
processing will still include full reads. Should your 
transactional system have quiet periods, you might 
be able to get away with proactive caching, but 
implementing a traditional ETL approach using 
incremental extraction procedures creates less of 
an impact against the OLTP system. 


Special thanks to BI experts at Solid Quality Mentors: 
Dave Fackler, Alejandro Leguizamo, Javier Loria, 


Jordi Rambla, Andreas Schindler, Craig Utley, and 


Erik Veerman. [SQL 


EYE ON BI 


t 


Douglas McDowell 


douglas @ SolidQ.com) is director of Business 


Intelligence at Solid Quality Learning, a 

SQL Server MVP, and a charter member of 
the Professional Association of SQL Server 
Regional Mentor program. He is an MCSE, an 
MCDBA, an MCT, and winner of Microsoft's 
Worldwide Business Intelligence Solution of 
the Year. 
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REASONS WHY YOU MIGHT THINK YOU SHOULD 
BUILD A CUBE ON A TRANSACTIONAL DATA MODEL 
Some of the cool new capabilities in SQL Server 2005 Analysis Services (SSAS) 
let you create OLAP cubes based on relational data models (i.e., normalized trans- 
actional or OLTP databases) that aren't designed for business intelligence (ВІ). 
SSAS supports many-to-many relationships, the ability to design cubes spanning 
many tables containing measures (even across multiple data sources), near real- 
time data access with proactive caching, and the ability to build more complex 
calculated members with robust MDX syntax. Those features help build better 
cubes on complex dimensional data models, but they open up possibilities—such 
as building OLAP cubes on OLTP databases—that aren't best practices. 

The ability to build a cube on a transactional data model can make sense in 
some scenarios. For instance, it's better to build a cube based on a transactional 
data model than to do reporting and analysis on a mission-critical OLTP system. At 
least the SSAS cube buffers the underlying system from the performance impact 
of BI, buffers the end-user from schema changes, and adds SSAS programmabil- 
ity, security, and performance. If you need to get BI moving in your organization, 
building a data source view over OLTP and adding SSAS could be a valid approach 
(e.g., as proof of concept or a Phase 1 effort). Some situations do require near 
real-time data, and building an OLTP-based cube and using proactive caching 
meets that requirement. However, such situations are rare; if needed, real time 
date can usually be married with a data warehouse-based solution. 


InstantDoc ID 96812 
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kevin.kline @ quest.com) is the director of 


technology for SQL Server Solutions at Quest 
Software and president of the international 
Professional Association for SQL Server. 

He is the author of SQL in a Nutshell, 

2nd edition (O'Reilly Media, 2004). 
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SQL Nexus 


Optimize SQL Server 2005 


performance 
Nexus is a powerful front-end visual- 


SQ ization tool for detecting, analyzing, 


and troubleshooting problems on SQL Server 2005. 
It uses SQLdiag on the back end as its main data 
collector. (For more information about SQLdiag, 
see Tool Time, “SQLdiag,” March 2007, InstantDoc 
ID 94853.) 

SQL Nexus is extensible. Not only does it let you 
run several useful built-in reports, it also lets you 
customize existing reports and write your own. The 
tool uses SQL Server Reporting Services (SSRS) 
to generate reports and charts from the diagnostic 
data it collects. SQL Nexus then uses SQL Server 
to collect and aggregate diagnostic information into 
a data warehouse; you can use this information for 
long-term diagnostics and performance analysis. 


Functionality 

After you download and install SQL Nexus, you 

can get useful information by running SQLdiag as a 

service, called diag$sqlnexus, and importing the per- 

formance data and trace files. The SQLdiag service 
is controlled through SQL Nexus's Service toolbar 
or by SQLdiag directly. 

The exact diagnostic data that SQL ар collects 
is determined by an XML configuration file that's 
passed to SQLdiag when it's registered. This file is 
stored in the Collection subfolder under the SQL 
Nexus startup folder. 

You can run SQL Nexus in real-time mode or 
postmortem mode. Real-time mode shows you data 
as it’s collected and periodically refreshes the various 
reports. Postmortem mode treats the data as static 
and doesn't automatically refresh. 

SOLNexus has four main features, each of which 
reveals a great deal about the performance of your 
SQL Server instance. 

1. CurrentServer Status: This feature provides the cur- 
rent status of your SQL Server machine. As Web 
Figure 1 (http:/Avww.sqlmag.com, InstantDoc ID 
96774) shows, this screen graphically details CPU 
utilization and memory utilization. It also pro- 
vides textual breakouts of database I/O statistics, 
system warnings, and currently active queries. 

2. Bottleneck Analysis: As Web Figures 2 and 3 
show, this option shows the major consumers of 
various system resources, as well as which system 
components are generating the most waits. 


ORE on the WEB 


See the Web figures at 
InstantDoc 10 96774. 


3. Blocking and Wait Statistics: If the Bottleneck 
Analysis reveals that blocking is the problem, 
you can use this feature to dive deeper into any 
existing blocking chains, as Web Figure 4 shows. 

4. Profiler Trace Analysis: This option tracks the 
most expensive SQL queries on your server and 
provides a graph that details CPU consumption, 
batch completion, reads, writes, and query dura- 
tion, as Web Figure 5 shows. 


SQL Nexus has built-in functionality to export 
its reports to most common formats, such as Micro- 
soft Excel, PDF, and several image types. You can 
also email SQL Nexus data as Excel spreadsheets. 
And finally, you can use SSRS or Microsoft Visual 
Studio to customize or even write all new reports for 
SQL Nexus. 


System Requirements 

SQL Nexus requires two main components. SQL 
Nexus itself is the main front-end visualization tool; 
SQLdiag is the main back-end data collection tool. 

You need to have a fairly large amount of free 
disk space available on your workstation or server, 
because some diagnostic data (e.g., a Profiler trace 
file) is collected on your local hard drive, then 
loaded onto the SQL Server machine after the fact. 
Although SQL Nexus deletes older data files as it 
loads new ones, you still need sufficient disk space 
to store a reasonable number of trace (.trc) files until 
cleanup can occur. The amount of disk space that's 
necessary varies widely, based on server load. 

SOL Nexus was the brainchild of Ken Hen- 
derson and Bart Duncan back when they were 
both with Microsoft's SQL Server Product Support 
Services (PSS). You can download SQL Nexus from 
the official Web site, at http://www.sglnexus.net. You 
can download the SQL Server 2005 Performance 
Statistics Script files, upon which the SQL Nexus 
reports are based, from Bart Duncan's February 21, 
2007, Microsoft PSS SQL Support blog entry; go 
to http://blogs.msdn.com/psssql/archive/2007/02/21/ 
sql-server-2005-performance-statistics-script.aspx 


and click the zip file at the bottom of the entry. And 


of course, we want to hear your feedback on the 
Tool Time discussion forum at http://www.sqlmag. 
com/go/tooltime. 500 

InstantDoc ID 96774 
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Getting to Know Wait Stats 


Further enhance your SQL Server 2005 
management skill with another DMV method 


ast month, in “Getting to Know Virtual File 

Stats" (InstantDoc ID 96513), I showed you 
how to use a particular dynamic management view 
(DMV) to monitor the usage of physical I/O with 
respect to SQL Server 2005. This month, I take a 
similar approach with another DMV that lets you 
monitor wait stats on your SQL Server instance. 


Wait Stat Basics 

Whenever a request is made within SQL Server 
that—for one of many reasons—can't be immediately 
satisfied, the system puts it into a wait state. The SQL 
Server engine internally tracks the time spent waiting, 
aggregates it at the instance level, and retains it in 
memory. Using a new DMV in SQL Server 2005 
called sys.dm_io_wait_stats, you can read these met- 
rics (or statistics) at any time. 

Sys.dm_io_wait_stats materializes the in-memory 
values of the wait stats as a typical result set that 
represents the total waits for all processes that have 
finished since the counters were last reset. Be aware 
that SQL Server resets these counters to 0 at each 
restart; alternatively, you can manually reset them 
by executing the command 


DBCC SQLPERF ('sys.dm_os_wait_stats', 
CLEAR); 


Keep in mind that each time you query this view, 
you'll get a snapshot from that point in time, which 
includes the total waits on the instance. Ideally, you 
want to see a differential or delta of two distinct 
snapshots so that you can better gauge the amount 
of waits during a particular time period. Table 1 
shows the output of the DMV. 

At the time of this writing, there are just over 
200 different wait types available for SQL Server to 
monitor. Microsoft will add more over time, with 
each new version and service pack. You'll prob- 
ably never have to deal with most of these wait 
types, so you don't have to worry too much about 
knowing what they all mean. If you want details 
about each wait type, check out the Microsoft article 
“SQL Server 2005 Waits and Queues” (http://www 
microsoft.com/technet/prodtechnol/sgl/bestpractice/ 


performance tuning waits queues.mspx). The article 
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includes some scripts for capturing and reporting on 
the waits in your system. In the SQL Server Magazine 
Performance Tuning and Optimization subforum 
(http://sqlforums.windowsitpro.com/web/forum/ 


categories.aspx?catid=256), ГЇЇ add my own rendition 


of how to capture and report on the wait stats in both 
SQL Server 2005 and SQL Server 2000. 


Digging into It 

Table 1% waiting tasks count column gives you an 
indication of how many times the processes have 
begun to wait, for that particular wait type. Gener- 
ally, the most important column is the wait_time_ms 
column, which tells you how many total milliseconds 
passed for each wait type. Obviously, the longer the 
wait time, the less efficient the process becomes. But 
another aspect of waits that you need to be keenly 
aware of is signal_wait_time. 

Figure 1, page 12, helps illustrate the importance 
of the signal_wait_time column. When a thread can’t 
continue because a resource has become unavailable, 
the system places the thread into a wait state of one 
type or another. This scenario can occur for a number 
of reasons, but for the purpose of this article, suppose 


TABLE |: DMV Results 


Andrew J. Kelly 
AKelly @ SolidQ.com) is a SQL Server MVP 


and the practice manager for performance 
and scalability at Solid Quality Mentors. He 
has 20 years experience with relational 
databases and application development. He 
is a regular speaker at conferences and 


user groups. 


Total number of times a wait of this type was 
Total time in milliseconds for this wait type, 


Maximum time in milliseconds for any one wait of 


Column Name Data Type Description 
wait_type nvarchar(60) Name of the wait type 
waiting tasks count bigint 

started 
wait time ms bigint 

inclusive of signal wait time 
max wait time ms bigint 

this wait type 
signal wait time bigint 


Difference between the time the waiting thread 


was signaled and when it started running 


you need to read 100 pages of data, but only the first 
few pages are in the data buffer cache. Also suppose 
that the request to fetch the other pages from the disk 
and place them into the cache where you can read 
them will take 20 milliseconds to complete, and you 
have lots of users requesting time on the processors. 
Once the thread goes into the wait state, the scheduler 
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TO = Resource requested is unavailable 
Wait state begins 


Signal wait time 


жс. 
= 
- 

== 


Figure 1 


WAIT STATS 


will pull that thread off the processor and place it in 
the worker queue so that another thread that's ready 
and able to process data can run. 

The point labeled TO in Figure 1 illustrates this 
moment, which is the start of the wait time. At this 
point, SQL Server starts keeping track of how long the 
thread has been waiting, and for which wait type. In 
this case, it will most likely be a PageIOLatch wait— 
typically associated with disk-to-memory transfers. 


T2 = Start using resource 
Wait ends 


After approximately 10 milliseconds, the I/O request 
has been satisfied and the page is in cache and ready 
to be read by your thread, as reflected by the point 
labeled T1. However, to read this data and continue 
the processing, the thread must be reloaded back 
onto the processor from the queue. If other threads 


Audit Data changes, 
Recover without a backup 


ApexSQL Log 


the Ultimate Log Reading, Auditing and 
Recovery tool for SQL Server 
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are currently running, it might be а while before the 
processor becomes available again—reflected as the 
point labeled T2. The time between T1 and T2 is 
called the signal wait time and really has nothing to 
do with the original problem that caused the wait but 
gets reflected in the overall wait time, which appears 
in the DMV’s wait time ms column. Lots of signal 
waits indicate excessive CPU pressure, whereas fewer 
signal waits place the bottleneck on the specific wait 
type resource. As you can see, this is an important 
distinction that —if not accounted for—can lead you 
down the wrong path. 


Find Bottlenecks 
If you want to know where your SQL Server instance 
spends most of its time waiting, simply use the 
built-in DMV to occasionally peek at these statistics. 
If you order the report in descending order of most 
waits to least waits, you can quickly see where your 
largest bottlenecks are and where you might want to 
investigate problems and determine how to minimize 
those waits. Keep in mind that there will always be 
waits in an active system. The key is to know when 
those waits are excessive and to be proactive in 
addressing them. Е 
InstantDoc ID 96746 


www.apexsql.com 


or phone 866-665-5500 
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Sharpen Your Skills: 
Routines and Clauses 


UDFs vs. stored procedures and 


HAVING vs. WHERE 


6 Server often provides several ways to 

accomplish a task. To choose the best 
technique for your situation, it helps to brush up 
on the differences between various options. In this 
series, I compare and contrast some similar SQL 
Server features that Im often asked about. This 
month, I compare user-defined functions (UDFs) 
with stored procedures and HAVING clauses with 
WHERE clauses. Enjoy the article and please 
send questions or topics you'd like me to cover to 
pinal@SQLAuthority.com. I look forward to your 
feedback! 


UDFs vs. Stored Procedures 
UDFs and stored procedures are both SQL Server 
objects that store one or more T-SQL statements 
in a single named, executable routine. Although 
you can often implement the same or similar 
functionality using either a UDF or a stored pro- 
cedure, the code will look significantly different 
depending on which technique you choose. Here 
are the main differences between UDFs and stored 
procedures: 

* A UDF must return a value—a single result set. 
A stored procedure can return a value—or even 
multiple result sets—but doesn't have to. 

* You can use a UDF directly in a SELECT state- 
ment as well as in ORDER BY, WHERE, and 
FROM clauses, but you can't use a stored proce- 
dure in a SELECT statement. 

• А UDF can't use a nondeterministic function such 
as СЕТБАТЕ(, NEWID(, or RANDO, whereas 
a stored procedure can use such functions. A 
nondeterministic function is one that can return a 
different result given the same input parameters. 

* AUDF can't change server environment variables; 
a stored procedure can. 

* A UDF always stops execution of T-SQL code 
when an error occurs, whereas a stored procedure 
continues to the next instruction if you've used 
proper error handling code. 


Both UDFs and stored procedures can perform well, 


depending on how you write the code. To determine 
whether a UDF or a stored procedure would yield 
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the best performance їп a particular implementa- 
tion, you should do performance testing. 


HAVING vs. WHERE 

You typically use the T-SQL HAVING clause 
along with the GROUP BY clause to search or sort «1 
based on a certain condition. But when you dont \ 
use GROUP BY, the HAVING clause acts like a 
WHERE clause to filter the results that a query 
should return. 


pinal @ sqlauthority.com) is a database 


Pinalkumar Dave 


administrator and project manager. He writes 


You can use the WHERE clause in SELECT, 
DELETE, and UPDATE statements, but you can 
use HAVING only in a SELECT statement. How- 
ever, HAVING can contain an aggregate function, 
such as COUNT(), whereas WHERE can't. 

The following two queries illustrate the WHERE 
and HAVING clauses: 


USE AdventureWorks 

GO 

--Return records that have an 
--OrderQty greater than 20. 
SELECT SalesOrderID, OrderQty 
FROM Sales.SalesOrderDetail 
WHERE OrderQty > 20 

GO 

--Return records that have an 
--OrderQty greater than 20 and a 
--total SalesOrderID greater 
--than 9. 

SELECT COUNT(SalesOrderID) 
TotalSalesOrderID, OrderQty 
FROM Sales.SalesOrderDetail 
WHERE OrderQty > 20 

GROUP BY OrderQty 

HAVING COUNT(SalesOrderID) 

> 9 

60 


regularly about SQL Server technology at 
http:/Awww.sqlauthority.com. 


The first query uses the WHERE clause to return 
all records that have an OrderQty greater than 20. 
The second query then uses the HAVING clause 
with the COUNTY) function to further filter those 
results, returning only records that also have a total 
SalesOrderID greater than 9. Ед 

InstantDoc ID 96745 
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Row-Concatenation Solutions Aren't All Equal 


L^ say you have a table with a column named 
color name and you want to store all the row 
values as a list of comma-delimited values. For exam- 
ple, if the column contains the row values of Yellow, 
Blue, and Red, the final result would look like: 


асу = {Yellow,Blue,Red} 


There are many ways to concatenate row values to 
get results like this. Although the various solutions 
produce the same result, the time it takes to get 
those results can vary significantly. With the hope 
of finding the most efficient row-concatenation 
solution, I tested two common approaches: using a 
self-reference variable within a SELECT clause and 
using FOR XML. 


LISTING 1: SampleData.sql 


IF object_id('T1','U') IS NOT NULL 
DROP TABLE T1 
CREATE TABLE Г4ро1.11711( 
Гсо111 varchar(20) NOT NULL, 
Гсо121 Гсһаг1(2000) NOT NULL DEFAULT ('a') 


»g 
DECLARE ài AS INT; 
SET ai = 1; 
WHILE 8i <= 50000 
BEGIN 


INSERT INTO dbo.T1(col1) 

VALUES(1 + ABSCCHECKSUM(NEWIDO) % 10000000); 
SET ai = 3i + 1; 

END 


LISTING 2: Solutionl.sql 


DBCC DROPCLEANBUFFERS 

DECLARE Әсу varchar (MAX) 

SET асу = !! 

SELECT асу = асу + col1 + ',' FROM T1 
SELECT LEFT(8cv, Len(a@cv)-1) 


LISTING 3: Solution2.sql 


I began by running the SampleData.sq] script in 
Listing 1 to create a table named T1 and populate 
it with 50,000 rows. (If you run this code so that 
you can test the solutions on your system, note that 
SampleData.sql takes a few minutes to run.) 

Next, I turned on the Discard results after execu- 
tion option in SQL Server Management Studio 
(SSMS) so that the time it took to generate the 
output wouldn't be taken into consideration. After 
enabling that option, I cleared the data cache and 
ran Solutionl.sql, which Listing 2 shows, to concat- 
enate rows by using a self-reference variable within 
a SELECT clause. It took 10 seconds for this code 
to run on my system. 

It's worth noting that some people prefer to use 
the following two system-defined functions to con- 
catenate rows: 


DECLARE асу varchar(MAX) 
SELECT асу = 
COALESCE(acv + ',', '') + 
ISNULL(col1,'<NULL>') FROM T1 
SELECT асу 


This code took even longer to run on my system (16 
seconds) because of the functions' overhead. 

I then tested Solution2.sql, which Listing 3 
shows, to concatenate rows by using FOR XML. 
It took less than 1 second for this code to run on 
my system, which is 10 times faster than the first 
solution. 

The first solution is not only slower but also 
leads to undefined results. This is due to the 
kind of assignment used within the SELECT 
clause. As Microsoft's Eugene Zabokritski notes, 
"The results of an assignment in the SELECT 
list...depend on what plan the optimizer chooses 
to execute the query, so technically speaking, 
the results are undefined." (For more informa- 
tion about undefined results, go to http://connect 
.microsoft.com/SQLServer/feedback/View Feed 
back.aspx?FeedbackID=126129.) 

— When you һауе a small number of row values, it 


John Alexander 
Lopez 


Download the code at 


InstantDoc ID 96784. 


probably doesn’t matter which row-concatenation so- 
lution you choose. However, when you're working with 
tables that contain a large amount of data, consider 
using FOR XML to concatenate row values. ВЕЩ 
—John Alexander Lopez, 

Database Administrator, Ubisoft Entertainment 
InstantDoc ID 96784 


Share Your Experiences 
Share your SQL Server code, comments, 
discoveries, and solutions to problems. Email 
your contributions to r2r(Q) sqlmag.com. Please 


DBCC DROPCLEANBUFFERS 

DECLARE @cv varchar (MAX) 

SET асу = (SELECT coli + ',' AS Ctext()] 
FROM T1 
FOR XML PATH('') 
ў 

SELECT LEFT(8cv, Геп(асу)-1) 
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һе SQL Server 2005 Reporting Services (SSRS) 
reporting framework satisfies the typical infor- 
mation requirements of most organizations, but 
the next generation of features available with 
SQL Server 2005 SP2 and Microsoft SharePoint 
products takes reporting to new levels. The inte- 
gration of SSRS and Microsoft Office SharePoint 
Server (MOSS) 2007 Enterprise Edition not only 
lets information workers more easily find, use, 
and share information across the enterprise, but 
it also simplifies report management and security 
implementation for application administrators. 
But what does better information access and sim- 
pler report administration really mean—and how 
much work will you have to do to make it work? 
In this article, the first in a two-part series, I 
briefly introduce you to SSRS and MOSS’s rich 
information-integration environment, explain the 
technical architecture, and walk you through set- 
ting it up. In the follow-up article, ГЇЇ show you 
how to deploy the integrated environment to sup- 
port better information access and integration, 
including how to implement security, report prop- 
erties, versioning, approval workflow, alerts, in- 
formation management policies, and the business 
intelligence support features built into MOSS. 


Improving Information Access 

SSRS integration with MOSS does more than 
enable information consolidation. MOSS's 
search capabilities let users find all relevant in- 
formation and reports at once instead of requir- 
ing them to look through a variety of reposito- 
ries. MOSS also lets you target reports to specific 
users or groups and supports sending alerts to us- 


ers when report definitions or properties change. 
In addition, MOSS collaboration features let us- 
ers supplement reports with background context 
using wikis or blogs and implement action plans 
using task lists. 

These capabilities all sound great for users, 
but how much extra effort does this mean for 
you as an administrator? Consider having to de- 
fine permissions for all types of information— 
reports, documents, spreadsheets, and more—in 
just one place. You can manage data sources in 
one place as well. You can also better manage 
report content through MOSS by requiring us- 
ers to check out reports to make modifications 
and check them back in afterward, implement- 
ing workflow to approve new reports before they 
are published, incorporating version control to 
track changes over a report’s lifetime, and en- 
forcing report retention policies. The most chal- 
lenging aspects of SSRS and MOSS integra- 
tion are component configuration and security 
implementation, which I'll cover in the second 
article of this series. 

However, before you leap to SSRS-MOSS in- 
tegration, be aware that MOSS doesn't support 
the following SSRS features: 

* Custom security extensions 

e Data-driven subscriptions 

* The rs.exe utility and scripts you use with it 
e Linked reports 

* My Reports 

* Job management features 


If you cant live without these features, you 
might consider maintaining two instances of 
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questions that need to be answered during an audit, the reasons why, 
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tial. The goal of this session is to discuss ways that you 
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This session will go through a "less 
really have to know about mo 
SQL Server 2005. Find out abo 
issues, things that will need to be 
successfully in place. 
what to expect. 


earned” look at what you 
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CONFERENCE SESSIONS 


SCHEDULE 
AT A GLANCE 


SUNDAY, NOVEMBER 4, 2007 

8:00 am-11:00 am Pre-Conference Registration ONLY 

9:00 am- 4:00 pm — Pre-conference Workshops 
MONDAY, NOVEMBER 5, 2007 

7:00 ат - 5:00pm Conference Registration 

9:00 am- 4:00 pm Рге-сопїегепсе Workshops 

6:30 pm- 8:30 pm — Microsoft Executive Keynote 

8:30 pm -10:30pm — Expo Hall/Opening Reception 

TUESDAY, NOVEMBER 6, 2007 * MICROSOFT DAY 

7:00 ат - 5:00 pm Conference Registration 

7:00 ат - 8:00 ат Continental Breakfast 

8:00 ат - 9:00 ат — Keynote 

9:30 ат -10:30 ат Conference Sessions 

10:45 am-11:45 am Conference Sessions 

11:45 am- 1:30pm — Lunch 

1:30 pm - 2:30pm Conference Sessions 

2:45 рт - 3:45pm Conference Sessions 

4:15pm - 5:15pm Conference Sessions 

7:30 рт - 9:00 рт — Microsoft Unplugged 

WEDNESDAY, NOVEMBER 7, 2007 

7:00 ат - 5:00pm Conference Registration 

1:00 ат - 8:00 am Continental Breakfast 

8:00 am- 9:15 ат ^ Conference Sessions 

10:00 am-11:15 am — Conference Sessions 

11:30 am-12:45 pm — Conference Sessions 

12:45 pm- 2:15pm — Lunch 


2:00 pm Harley-Davidson Drawing in the Expo Hall 
2:15 pm Expo Hall Closes 


2:15 pm - 3:30 рт Conference Sessions 
4:15 рт - 5:30 pm Conference Sessions 
THURSDAY, NOVEMBER 8, 2007 
7:00 am - 8:00 am Continental Breakfast 
8:00 ат 9:15am Conference Sessions 
9:30 ат - 10:45 am Conference Sessions 
11:30 am- 12:30 pm Conference Sessions 
12:30 pm- 2:00 pm — Lunch 
2:00 рт - 3:00 pm Conference Sessions 
3:00 pm- 3:30 рт Ice Cream Break 
3:30 pm- 4:15pm Closing Session 
FRIDAY, NOVEMBER 9, 2007 
9:00am- 4:00 pm — Post-conference Workshops 


SEE WEB SITE FOR THE LATEST UPDATES. 
www.DevConnections.com 
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BUSINESS INTELLEGENCE 


SBI339: AUTOMATING SSAS PARTITION 
CREATION AND PROCESSING 
SCOT REAGIN 


Partitioning Analysis Services data can signifi- 
cantly benefit both process and query perform- 
ance. However, manually creating and configur- 
ing partitions over time can be a maintenance 
headache. This session explores strategies for 
partition design and the automation of both 
creating partitions and processing intelligent 
partitions in Integration Services. 


SBI264: CONFIGURING REPORTING 
SERVICES SECURITY CORRECTLY 
STACIA MISNER 


Understanding the relationship between 


Reporting Services and ASP.NET, IIS, the database, 


and Windows security is vital to establishing the 
appropriate security policy for your environment 
and configuring the report server correctly. 

This session starts with a review of the security 
architecture of Reporting Services in native and 
SharePoint integrated mode. This session also 
includes a series of practical examples that show 
you how to set up IIS authentication, service 
accounts, and Reporting Services data sources 
for various deployment scenarios, including sin- 
gle server or multiple server environments. You'll 
see the Reporting Services security architecture. 
You'll understand the dependencies between 
Reporting Services, ASP.NET, IIS, database, and 
Windows security layers. You'll learn how to con- 
figure IIS, service accounts, and data source cre- 
dentials to support security requirements. 


SBI262: CREATING REPORTS WITH 
REPORTING SERVICES 2005 AND 
ANALYSIS SERVICES 2005 

STACIA MISNER 

With the introduction of Reporting Services 2005, 
you now have a Query Designer to generate the 
MDX required to retrieve data from a cube as a 
great first step, but you can extend this query to 
satisfy more complex reporting requirements. In 
this session, you'll learn several tips and tricks 
about working with Analysis Services 2005 as a 
source for your reports, including how to cus- 
tomize an MDX query, how to handle aggregate 
values, and how to implement cascading parame- 
terized MDX queries. You'll understand the bene- 
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fits and limitations of the MDX Query Designer. 
You'll learn how to modify an MDX query to pro- 
duce specific results. And you'll learn how to 
override default parameter behavior to create 
cascading parameters. 


SBI265: DATA MINING FOR THE REST OF US 
STACIA MISNER 

Don't worry if you don't hold a PhD in data min- 
ing or even if you didn't take statistics in college. 
You can still use and, better yet, understand data 
mining now that Data Mining Add-Ins for Office 
2007 is available. Whether you regularly analyze 
data now or provide technical support for those 
who do, it's time to learn how take business intel- 
ligence to the next level in your organization. 
This session will show you specific examples for 
exploring common data sets, such as sales and 
financial data, to find the hidden information in 
your business. You'll understand the technical 
architecture requirements for the Data Mining 
Add-Ins. You'll learn how to prepare data for data 
mining. You'll learn how to apply data mining 
techniques to specific business problems. 


SBI338: ETL AUDITING: BI FOR 
WAREHOUSE PROCESSING 

SCOT REAGIN 

Integration Services can update your warehouse 
quickly and efficiently. It can also produce audit 
metrics that you can use to better understand 
the health of your warehouse data and detect 
trends in processing before they become expen- 
sive problems. This session explores Integration 
Services package design for capturing data and 
processing audit metrics, loading these metrics 
into Analysis Services, and analyzing the results 
with Reporting Services. 


SBI353: GET A HANDLE ON SSIS DATA 
CLEANSING CAPABILITIES 

ERIK VEERMAN 

In the ever-increasing world of data and integra- 
tion, it is rare to find pure data. Most data 
sources have missing values, typographical 
errors, duplicated rows, mismatched rows, or just 
plain bad data... For those of you who are data 
integrators, DBAs, or ETL designers who have to 
work with this, you're probably pulling your hair 
out! Don't miss this session if you can relate. 
While there's no magic wand solution, SSIS will 
help you get there faster and with less stress. 
Attend this session to learn how to best apply 


the data cleansing capabilities in SSIS such as 
fuzzy logic, expressions, and data mining, to get 
a better grip on your bad data. 


5В1337: INTEGRATING OFFICE 2007 WITH 
MICROSOFT BI SOLUTIONS 

SCOT REAGIN 

With Microsoft's Office 2007 you can deliver the 
right information to all users in your organiza- 
tion quickly, efficiently, and in a format they 
can understand and use. This session discusses 
how to design and deploy Office 2007 as part 

of an integrated Business Intelligence solution 
that provides users with rich reporting, analysis, 
visualization, and performance management 
capabilities. 


5В1449: SSIS PIPELINE OPTIMIZATION 
TECHNIQUES 

ERIK VEERMAN 

This session dives deep into the underlying 
structure of the SSIS data flow pipeline in order 
to understand how the SSIS engine is handling 
your data and how to monitor and optimize your 
data flow design. This review includes a drill- 
down into the data buffer architecture, execution 
trees, engine threads, column lineage, synchro- 
nous and asynchronous transformation outputs, 
data flow buffer tuning, pipeline monitoring, 
pipeline optimization, error identification, and 


troubleshooting. Understanding the pipeline 
internals will lead to better package designs with 
more efficient and scalable data flow architec- 
tures for your SSIS environment. 


SBI252: UPGRADING YOUR DTS-BASED 
SOLUTION TO SSIS 

ERIK VEERMAN 

While the DTS upgrade wizard to Integration 
Services (SSIS) may be a click of the button, mov- 
ing from a DTS-based architecture in SQL Server 
2000 to an SSIS design in SQL Server 2005 requires 
planning and consideration. This session will focus 
on upgrading DTS ETL design to take advantage of 
the enterprise functionality that SSIS provides. 
You'll learn how typical ETL processes were 
designed in DTS and how to map those processes 
to the more architecturally friendly SSIS. 


DEVELOPER 


SDV386: AN OVERVIEW OF T-SQL 
ENHANCEMENTS IN SQL SERVER 2008 
ITZIK BEN-GAN 

This session will give you an early overview with 
code samples and demonstrations covering some 
of the interesting new T-SQL features in SQL 
Server 2008. Among the new T-SQL features that 
this session will cover are: the MERGE statement, 
Grouping Sets, Table-Valued Parameters, Date, 
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Time, and timezone aware datatypes, Row 
Constructors, and more. 


SDV354: BEST PRACTICES IN 
DEVELOPING SQL-CLR OBJECTS 

GERT DRAPERS 

Are you afraid to use SQL-CLR objects? This session 
will introduce you to best practices and do's and 
don'ts of using SQL-CLR objects. Stop being afraid. 
Being educated about how SQL-CLR objects behave 
inside SQL Server is your best line of defense. 


SDV384: BRIDGING THE GAP BETWEEN 
CURSORS AND SETS 

ITZIK BEN-GAN 

Many passionate debates revolve around whether 
cursors have a place in the database world. 
Cursors are often misused when the problem calls 
for a set-based querying solution. Misuse of cur- 
sors is often a result of lack of knowledge and 
experience of the relational model and set-based 
querying. Typically, set-based solutions are sim- 
pler and more efficient than cursor-based solu- 
tions; however, there are types of problems for 
which cursor-based solutions outperform set- 
based ones. In this session you will learn why, in 
the majority of the cases, set-based solutions are 
the way to go, and what are the types of problems 
where cursors have better performance potential. 
This session will also introduce new language ele- 
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ments introduced in SQL Server 2005 that help in 
bridging the gap between sets and cursors. 


SDV383: DATETIME RELATED PROBLEMS 
AND TEMPORAL QUERIES 

ITZIK BEN-GAN 

Datetime manipulation is in the heart of every 
database system. The datetime datatype intro- 
duces many challenges due to different language 
and cultural conventions and due to the fact that 
to date there's no separation between date and 
time. This session will talk about the challenges 
involved with datetime manipulation and tech- 
niques to address those. The session will also dis- 
cuss querying problems related to temporal data 
including overlaps, grouping by week, and so on. 


SDV371: EXTENDING THE DATA 

TYPE SYSTEM 

DON KIELY 

Running .NET code within SQL Server has opened 
up a lot of amazing possibilities, providing flexible 
options for working with relational data. 

But defining your own user-defined types is easily 
the most complex and powerful thing you can do 
with SQL-CLR code. This session will explore how 
to create custom types that perform well, write 
attributes correctly, and use the right serializa- 
tion formats. We'll also talk about issues you 
should consider when designing your own types. 


SDV355: POWER TO THE COMMAND LINE 
GERT DRAPERS 

Manage your SQL Server installations from your 
command line using Microsoft PowerShell. Learn 
how to leverage the new PowerShell script shell 
in combination with the SQL Server PowerShell 
provider and CmdLets to manage your SQL 
Server environments from within a flexible and 
powerful scripting environment. If you have a 
need to automate your SQL Server management 
tasks, PowerShell is your new friend! 


SDV276: SQL SERVER EXPRESS AND 
COMPACT EDITION: WHEN, WHY, AND 
SYNCHRONIZING 

WILLIAM R. VAUGHN 

As Microsoft fine-tunes its DBMS strategies, 
another old player has been reconfigured to pro- 
vide a fast, light, yet highly reliable platform- 
SQL Server Compact Edition. This DBMS engine is 
designed to replace or supplement MSDE and SQL 
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Server in architectures that don't need the over- 
head of a service-based DBMS engine. It's also 
designed to replace JET where customers insist 
on security and reliability (imagine that). This 
session helps developers understand where SQL 
Server Express Edition and SQL Server 2005 
Compact Edition fit together, especially given 
that the new ADO.NET 3.0 Synchronization 
Services classes can synchronize your SQL 
Server and SQLCe databases quickly and easily. If 
you're a developer, architect, or IT manager try- 
ing to determine which DBMS engine you should 
use where and why, this is your session. 


SDV367: WHAT'S SO COMMON ABOUT 
COMMON TABLE EXPRESSIONS? 

DON KIELY 

In databases of the last millennium, recursive 
queries were a royal pain to implement, often 
requiring temporary tables, cursors, logic to con- 
trol the flow, and a head full of hair to pull out 
while writing them. Yet recursive queries are 

the best way to go when you need to fit relation- 
al data into a hierarchical structure, such as 
with organizational charts or object/subobject 
scenarios. Common Table Expressions (CTE) in 
SQL Server 2005 greatly simplify writing recur- 
sive queries, making it much easier to use this 
valuable technique in your code. This session 
explores how CTEs work, how to write them and 
use them in sophisticated recursive queries, 
what they are good for, and how to keep your full 
head of hair (if you arrive with one). 
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SDB351: FOLLOW THE RABBIT- 
INTERACTIVE Q&A ON AVAILABILITY 
KIMBERLY L. TRIPP 

PAUL RANDAL 

In this session, Kimberly Tripp and Paul Randal 
will have only 5-10 slides. The focus of this ses- 
sion is on mixing availability technologies to cre- 
ate the best overall architecture to minimize 
downtime and data loss. In general, we're going 
to focus on best practices and then open up to 
your questions so that you can drive the discus- 
sion! This session might not seem as structured 
as other sessions, but you'll be surprised at how 
informative and fun it is! 
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SDB349: FOLLOW THE RABBIT- 
INTERACTIVE Q&A ON THE STORAGE 
ENGINE AND THE RELATIONAL ENGINE 
KIMBERLY L. TRIPP 

PAUL RANDAL 

In this session, Kimberly Tripp and Paul Randal 
will have only 5-10 slides. Each slide covers topics 
for discussion as well as the reason(s) for why 
something might be behaving badly and/or 
things to try to solve your problems. In general 
we're going to focus on best practices and then 
open up to your questions so that you can drive 
the discussion! Paul will focus on the SE (Storage 
Engine) and internals and Kimberly will focus on 
the RE (Relational Engine) and query tuning/per- 
formance. This session might not seem as struc- 
tured as other sessions, but you'll be surprised 
at how informative and fun it is! 


SDB352: MEMORY MANAGEMENT- 
DO YOU HAVE ENOUGH AND ARE 
YOU USING IT EFFECTIVELY? 
GERT DRAPERS 


Can you ever have enough memory? To answer 
this question you first have to learn how SQL 
Server uses your memory and how your process 
platform (x32, x64, or ia64) influences the use of 
memory. After this session you will be able to 
determine if you have enough, what is using the 
memory inside your system, and how to trou- 
bleshoot memory bottlenecks. 


SDB37T: SERVER-SIDE TRACE QUEUES: 
AUTOMATING, CONTROLLING, AND 
SCRIPTING TO MINIMIZE THE IMPACT 
OF PROFILING 

ANDREW KELLY 


Do you put off performance monitoring tasks 
because it is time consuming and cumbersome 
to set up or deal with? Would you like to auto- 
mate this whole process? Do you know how to 
script a trace, start perfmon, or load a trace file 
programmatically? If you have any doubts in this 
area you need to attend this session. Stop wast- 
ing time and be more productive. 


SDB376: SQL SERVER AGENT: FEATURES 
IN SECURITY AND AUTOMATION 
ANDREW KELLY 

SQL Server Agent is used by most DBAs but not to 
its truest potential. SQL Server 2005 brought 
some great new additions to the SQL Server 


Agent that makes it even more useful and secure 
than ever. This session will explore the roles asso- 
ciated with SQL Agent along with Proxy accounts 
and Credentials required by some Job steps. This 
session will also cover all of the aspects that you 
are probably under-utilizing now including 
Tokens, Alerts, Schedules, and more. 


SDB352: SQL SERVER DATABASE 
STRATEGIES: PHYSICAL DATABASE 
DESIGN FOR PERFORMANCE AND 
AVAILABILITY 

GERT DRAPERS 


Do you know how to get the best out of your disk 
subsystem? File creation and recreation are cru- 
cial when you need to recover your database; file 
and object placement also have an influence on 
the performance of your overall system. This ses- 
sion will introduce you to the main aspects you 
need to know about the SQL Server 1/0 architec- 
ture in order to optimize your file usage of per- 
formance and increase your availability. 


SDB347: SQL SERVER INDEXING FOR 
PERFORMANCE-FINDING THE RIGHT 
BALANCE 

KIMBERLY L. TRIPP 


In terms of performance tuning, there are few sil- 


ver bullets. If | had to choose ONE area that 
improves performance the most (when designed 
appropriately!), it's indexing. However, indexing 
strategies depend on the data and even more so, 
the usage of the data. Come to this session to 
see what indexing strategies help the base table 
the most as well as how to optimize your worst- 
performing queries. 


SDB348: SQL SERVER INDEXING 
STRATEGIES-ARE YOU SURE? 
KIMBERLY L. TRIPP 


Knowing tips and tricks to indexing is extremely 
helpful and will help you to solve "known" prob- 
lems. But what's lurking in the unknown? Is SQL 
Server using your indexes? Or, do you have a 
bunch of useless indexes? Finally, SQL Server 2005 
has an answer! SQL Server 2005 DMVs (Dynamic 
Management Views) can provide you with valuable 
information about your current indexing strate- 
gies, what should be removed, and even what's 
missing. Do you know how to find this information, 
leverage it, and then programmatically respond to 
it? Come to this session to figure it out! 


SDB350: SQL SERVER TABLE STRATEGIES 
-DESIGNING FOR PERFORMANCE AND 
AVAILABILITY 

KIMBERLY L. TRIPP 

PAUL S. RANDAL 

Often tables are designed based solely on the 
data that needs to be tracked (here's a column 
name, here's a data type-done!). Unfortunately, 
design does not usually take into account how 
the data is going to be used OR how SQL Server 
uses the data. Knowing the internals of table 
structures as well as the optmizations that come 
with good design will make your database truly 
scalable. Come to this session to learn some 
internals as well as various design strategies 
such as vertical and horizonal paritioning. 
Additionally, are there any other features that 
require changes in your design and thinking? For 
example, online index operations impact design 
because of the limitations that exist with parti- 
tioning and LOB columns. If you want to scale, 
you need to be here! 


SDB479: STORED PROCEDURE 
PERFORMANCE: TROUBLESHOOTING 
COMPILES AND RECOMPILES 

ANDREW KELLY 

Lack of plan reuse is one of the biggest perform- 
ance killers in SQL Server. Come see how to 
determine if this is an issue for you and more 
importantly, how to address it. See what deter- 
mines if plans get reused or not and why. You'll 
learn how to issue calls to the database that 
guarantee reuse and peak performance. This 
session will also cover why plans recompile 
and how to minimize or eliminate this potential 
performance hit. 
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MONDAY, NOVEMBER 5, 2007 


HMSKEY: DRIVING THE BUSINESS WITH SHAREPOINT 


TOM RIZZO, MICROSOFT 


HMS201: MICROSOFT OFFICE SHAREPOINT SERVER 2007 OVERVIEW 


TOM RIZZO, MICROSOFT 


HMS202: MICROSOFT WINDOWS SHAREPOINT SERVICES 


3.0 OVERVIEW 
LAWRENCE LIU, MICROSOFT 


HMS203: SHAREPOINT GOVERNANCE AND INFORMATION 


ARCHITECTURE GUIDANCE 
JOEL OLESON, MICROSOFT 


HMS204: MICROSOFT SHAREPOINT PRODUCTS AND 
TECHNOLOGIES 2007: ADMINISTRATIVE ARCHITECTURE 


AND PLANNING FOR DEPLOYMENT, PART 1 
JOEL OLESON, MICROSOFT 


HMS305: MICROSOFT SHAREPOINT PRODUCTS AND 


HMS206: DESIGNING AND BUILDING SOPHISTICATED 


COMPOSITE APPLICATIONS WITH MICROSOFT OFFICE 


SHAREPOINT DESIGNER 2007 


JEROME THIEBAUD, MICROSOFT 


HMS307: CAPACITY AND PERFORMANCE PLANNING FOR 


MICROSOFT SHAREPOINT PRODUCTS AND TECHNOLOGIES 2007 
JAMES PETROSKY, MICROSOFT 


HMS308: MICROSOFT ASP.NET AJAX 1.0 AND SHAREPOINT 


MIKE AMMERLAAN, MICROSOFT 
HMS309: HIGH AVAILABILITY AND DISASTER RECOVERY FOR 


MICROSOFT SHAREPOINT PRODUCTS AND TECHNOLOGIES 2007 
JAMES PETROSKY, MICROSOFT 


HMS310: SEARCH IN MICROSOFT OFFICE SHAREPOINT SERVER 
2007: CUSTOMIZING AND EXTENDING 


TECHNOLOGIES 2007: DEPLOYMENT AND ADVANCED 


ADMINISTRATION TOPICS, PART 2 
JOEL OLESON, MICROSOFT 


TOM RIZZO, MICROSOFT 


SESSIONS AND SPEAKERS ARE SUBJECT TO CHANGE. 


CONFERENCE SESSIONS * TUESDAY AND WEDNESDAY, NOVEMBER 6-7, 2007 


HDV301: BUILDING AND INCORPORATING 
CUSTOM APPLICATIONS IN SHAREPOINT 
V3 SITES 

ANDREW CONNELL 


HDV302: BUILDING AN INTERNET SITE 
WITH WEB CONTENT MANAGEMENT 

IN MICROSOFT OFFICE SHAREPOINT 
SERVER 2007 

ANDREW CONNELL 


HDV303: PRESCRIPTIVE GUIDANCE FOR 
DEVELOPERS BUILDING PUBLISHING SITES 
WITH WCM IN MOSS 2007 

ANDREW CONNELL 


HAD301: END-TO-END SOLUTIONS WITH 
THE 2007 RELEASE: DEVELOPING FOR 
IT PROS 

DAVID GERHARDT 


DV307: DEVELOPING END-TO-END FORMS 
SOLUTIONS FOR SHAREPOINT, PART 1 
DAVID GERHARDT 


HDV308: DEVELOPING END-TO-END FORMS 
SOLUTIONS FOR SHAREPOINT, PART 2 
DAVID GERHARDT 


SHAREPOINT CUSTOM WEB PART 


DEVELOPMENT-FUNDAMENTALS 
EMER MCKENNA 


SEE WEB SITE FOR SESSION 
ABSTRACTS AND SPEAKER BIOS 
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WHAT'S NEW WITH EVENTS IN WINDOWS 
SHAREPOINT SERVICES 3.0? 
EMER MCKENNA 


BRANDING YOUR MICROSOFT OFFICE 
SHAREPOINT SERVER 2007 SITES USING 


SITE DEFINITIONS AND FEATURES 
EMER MCKENNA 


HAD202: NO-CODE WORKFLOWS IN WSS V3 
DUSTIN MILLER 


HDV210: SHAREPOINT DESIGNER: 
IT'S FOR DEVELOPERS, TOO! 
DUSTIN MILLER 


HDV311: DATA VIEW / DATA FORM 
WEB PARTS: DIVE DEEP! 
DUSTIN MILLER 


HAD204: EXPLORING THE NEW MICROSOFT 
FOREFRONT SECURITY FOR SHAREPOINT 
MICHAEL NOEL 


HAD205: MONITORING A SHAREPOINT 
FARM USING SYSTEM CENTER 
OPERATIONS MANAGER 2007 
MICHAEL NOEL 


HAD305: VIRTUALIZING SHAREPOINT 
2007 ARCHITECTURE 
MICHAEL NOEL 


HDV212: CUSTOMIZING THE SHAREPOINT 
MOBILE EXPERIENCE 
NEIL IVERSEN 


HDV313: ADVANCED FEATURE 
DEVELOPMENT 
NEIL IVERSEN 


HDV314: POWERSHELL FOR 
SHAREPOINT DEVELOPERS 
NEIL IVERSEN 


HAD306: SHAREPOINT DIARIES 
RICHARD TAYLOR 


HAD307: CUSTOMIZED SITE TEMPLATE 
AND DEFINITION MIGRATION 
RICHARD TAYLOR 


HAD308: FORMS AUTHENTICATION-HOW 
TO GET YOUR INTERNET-FACING MOSS SITE 


UP AND RUNNING 
RICHARD TAYLOR 


HDV315: GETTING THE MOST OUT OF THE 
BUSINESS DATA CATALOG 
TODD S. BAGINSKI 


HDV316: IMPLEMENTING THE MOSS SSO 
SERVICE IN REAL-WORLD SITUATIONS 
TODD S. BAGINSKI 


HDV309: SHAREPOINT OBJECT MODEL / 
WEB SERVICES KICK START 
TODD S. BAGINSKI 


BONUS EVENTS 


Attend SQL Server Connections and attend the sessions of the concurrently run sessions for FREE! 


MICROSOFT 
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CONNECTIONS) 


Accessing Data Services in the Cloud 
Building Rich Interactive Applications with Silverlight 11 


eveloping Data-Driven Applications Using the New Dynamic Data 
Controls in ASP.NET 


integrating Search into Your ASP.NET Applications 

integrating Video into Your ASP.NET Application with Silverlight 
uby and the DLR in Silverlight 

Using Visual Studio 2008 to Build ASP.NET Applications 

Building a LINQ-Based Business Layer for ASP.NET Applications 
Building Data-driven Web Pages in ASP.NET 2.0 

ata Binding in ASP.NET with LINQ 

ASP.NET Deep Dive 

ASP.NET Tips & Tricks 

CSS 101-Moving Away from Table-based Layout 

Dealing with Long-running Requests in ASP.NET 

Is Your Web Site Legal? Accessibility for Visitors with Disabilities 


Building ASP.NET 2.0 Applications with Support for Personalization 
nd Membership 


à 
ASP.NET and WCF: Meet Your New Web Service 

Architectural Considerations for ASP.NET Applications 
Creating N-Layer ASP.NET 2.0 Applications 
D 
B 
P 
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esign Patterns Make Sense in ASP.NET? 

ilding High-performance ASP.NET Applications 
ogramming SQL Server 2005 Reporting Services 
ilding Dashboards: Web Parts or AJAX? 

uilding Portal Applications with ASP.NET 2.0 Web Parts 
ASP.NET Security from A to Z 
Writing Custom ASP.NET Membership Providers 
chitecting ASP.NET AJAX Applications 
ASP.NET AJAX Control Toolkit Overview 

uilding Custom ASP.NET AJAX Controls 


Enhancing an Existing ASP.NET 2.0 Site Using the Microsoft 
AJAX Framework 


The New Frontier of the Web User Interface: Silverlight 
and ASP.NET AJAX 


User Interface Options for Web Development 
Data Control Tips and Tricks 
Advanced Site Navigation in ASP.NET 2.0 


Test-driven Development and Continuous Integration 
for ASP.NET and VSTS 


œ 


= 


œ 


Dive into the 
new releases 
with Microsoft 
architects and 
industry experts! 
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Create Software That Your Users Will Instantly Know and Love 
Enhance User Productivity and Affinity With Mobile Solutions 


Native Code; From Creating New Software to Maintaining 
Existing Investments 


Next Generation Web Experiences 


Realize More Time For Adding Value By Utilizing 
the Latest Application Frameworks and Platforms 


Revolutionary User Experiences 

Safe, Secure and Reliable Applications Create Satisfied Users 
Ship High Quality Software On Time and On Budget 

Software Development in 2008 and Beyond 

Visualize Your Data 


Optimizing Online, Enabling Offline with SQL Server Compact Edition 
and Sync Services for ADO.NET 


ADO.NET Entity Framework Designers 
ntity Framework 


ata Modeling and Application Development with the ADO.NET 
Entity Framework 


What's New in Visual Basic 9.0? 

troduction to Cit 3.0 

ntroducing the Office 2007 Open XML File Formats 
ilding Data Visualization Applications with WPF 
esigning Polished Interfaces with Expression Blend 
Fundamentals of Windows Presentation Foundation 
Make WPF Databinding Work Against Business Objects 
WPF in Windows Forms and Vice Versa 
A 
B 


со 


vanced WF 

uild Workflow Activities using Objects 

esigning and Building Workflow Solutions for SharePoint 2007 
Fundamentals of Windows Workflow Foundation 
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PRE-CONFERENCE 


NOVEMBER 4 


PRE-CONFERENCE WORKSHOP > 9AM-4PM • SQL SERVER WORKSHOP 


SPR301: SQL SERVER DATABASE MAINTENANCE: 

FROM PLANNING TO PRACTICE TO POST-MORTEM 

KIMBERLY L. TRIPP AND PAUL S. RANDAL 

No matter how much effort you spend on the design of your database, if you 
don't maintain it in production then it will suffer from performance and man- 
ageability problems. The key to continued performance and smooth opera- 
tions is a well thought-out and automated database maintenance plan. This 
full-day workshop has three sections: planning, practice, and post-mortem. 
Planning for database maintenance actually starts with database design, so 
one of the things covered will be how to avoid design choices that limit data- 
base maintenance or contribute to maintenance problems. The session will 
discuss a laundry-list of maintenance problems, and then explore how to tell 
if you need to mitigate them, strategies and best-practices for doing so, and 
how to avoid having your mitigation choices cause unforeseen and undesir- 
able side-effects. Topics covered will include database files (shrink, grow, vir- 
tual log files, log size/management), consistency checks and corruption 
detection, fragmentation, statistics, backup/restore (options, granularity, 
strategies) and recovery models. The workshop will vary between 200-400 
level covering ALL the key concepts of maintenance features. Finally, the ses- 
sion will spotlight some real-world examples where people made good and 
bad choices and discuss how you can repeat or avoid them, respectively. If 
you're wondering how to bring your database back under control, and keep it 
there, then this full-day workshop will help you tame maintenance problems 
whether you're a full-time system administrator or a reluctant DBA. 


NOVEMBER 5 


PRE-CONFERENCE WORKSHOP + 9AM-4PM • SQL SERVER WORKSHOP 
SPR302: CONTROL YOUR SQL SERVER DEVELOPMENT 

GERT DRAPERS 

This workshop will jump start you into using “Visual Studio for Database 
Professionals.” Having control over your SQL Server development environ- 
ment is the first step to better quality, easier deployment, and reducing the 
complexity and risk involved in changing and deploying the SQL Server data 
tier changes which are needed by your applications. Visual Studio Team 
Edition for Database Professionals can help you apply structure to your 
schema development, organization, versioning and deployment. Learn how 
you can manage your database schema; enable team development; 
integrate the data tier into the overall development lifecycle, and stimulate 
the collaboration between disciplines. Get answers to questions like: How 
does SQL Server server-side development fit in to the overall development 
lifecycle? How do you use version control with your SQL Server schema 
objects? How do you incrementally deploy schema changes? How do you 
create a test environment? 


PRE-CONFERENCE WORKSHOP • 9AM-4PM + SQL SERVER WORKSHOP 
SPR303: SQL SERVER DISASTER RECOVERY: FROM PLANNING 

TO PRACTICE TO POST-MORTEM 

KIMBERLY L. TRIPP AND PAUL S. RANDAL 

Every DBAs nightmare is having down time and data loss and not knowing how 
to recover. However, designing and implementing a successful disaster recovery 
strategy is easier said than done. It's about asking all the right questions and 
figuring out all the best answers for your situation. This fullday workshop has 
three sections: planning, practice and post-mortem. Planning is a critical part of 
disaster recovery, but the most-often disregarded. Topics we'll cover here 


include: How do you choose technologies to fit requirements and effectively use 
key features of SQL Server 2005? How do technology choice affect workload 
performance? Putting a well-thought out plan into practice requires even more 
planning and in this section we'll discuss technology implementation, building 
step-by-step recovery/operation guides for when disasters happen, and, most 
importantly, testing your plan by simulating real problems. In the final section, 
we'll spotlight some realworld examples where people made costly mistakes 
and show you how they could have been avoided with a little planning and 
practice. If you've ever had nightmares about disaster recovery (or actually 
had a disaster!) and been at a loss for what to do, then this full-day work- 
shop will give you the direction and technical details you need for success! 


PRE-CONFERENCE WORKSHOP > 9AM-4PM > VISUAL STUDIO WORKSHOP 
VPR302: INTRODUCTION TO WINDOWS WORKFLOW FOUNDATION 
KEN GETZ 


Windows Workflow Foundation (WF) provides a standard, extensible, easy-to-use 
framework for building applications that can be represented in a flow diagram. 
Because that's pretty much any business application, you can think of WF as a 
way to build applications in which the model is the application. By tying the dia- 
gram you create in Visual Studio directly to the code you write to glue the 
pieces together, WF provides a new paradigm for building applications. In this 
introduction to Windows Workflow Foundation, you'll learn the basics you need 
to understand the more advanced topics you'll attend in the conference. In this 
workshop you'll become comfortable with WF, including handling conditions and 
exceptions, and using flow activities. You'll learn about the differences between 
sequential and state machine workflows, and you'll learn how to make deci- 
sions in workflows. Finally, you'll learn how to interact with workflows from the 
host application. If you need to learn WF, or you're just interested in seeing 
what all the talk is about, you should attend this workshop. 


PRE-CONFERENCE WORKSHOP > 9AM-4PM • VISUAL STUDIO WORKSHOP 
VPR101: WPF-A SCENARIO-BASED APPROACH 

BILLY HOLLIS 

Windows Presentation Foundation is not just Microsoft's newest UI technolo- 
бу. It's also the most widely useful UI technology ever to enter the industry. 
In fact, a major barrier to using Windows Presentation Foundation is a solid 
understanding of the various scenarios for which WPF is a good choice. This 
session covers several of the most typical scenarios for WPF, including a wide 
range from a supplement to typical Web-based systems through a line-of- 
business application with more interactive UI, and up to an advanced 
media/graphics system using advanced 3-D capabilities. I'll start by dis- 
cussing basic WPF concepts, with the assumption that the audience has not 
been exposed to WPF. The session continues by covering the major designers 
for WPF, including the Visual Studio designer and the Blend designer. Then, 
for each application scenario, I'll cover the major technologies in WPF that 
make it a good choice, and then show and analyze a sample application that 
meets the scenario. 


PRE-CONFERENCE WORKSHOP > 9AM-4PM + ARCHITECTURE WORKSHOP 
VPR301: IMPROVE YOUR SOA: DESIGNING A SECURE, RELIABLE, 
AND SCALABLE SYSTEM 

MICHELE LEROUX BUSTAMANTE 

During the architecture design phase, once you have completed a high-level 


view of the system, you are in a position to make important choices that 
impact the overall security, performance, and distribution of system function- 
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ality. This workshop will show you how to implement architectural patterns 
common to WCF deployments that address large message sizes, asynchro- 
nous functionality, authentication and authorization at every tier, transfer 
security, distributed transactions, and exception handling. In addition, you'll 
learn the implications of session on load distribution and fail-over, and learn 
how to build a router or ESB to monitor or distribute load. 


PRE-CONFERENCE WORKSHOP > 9AM-4PM • ASP.NET WORKSHOP 
APR201: ASP.NET AJAX IN A DAY-BRING YOUR OWN LAPTOP 
PAUL LITWIN & STEPHEN WALTHER 


In this hands-on workshop, you'll learn how to take advantage of Microsoft 
ASP.NET AJAX to build Web sites that provide a richer user experience. You'll 
learn how to use the new UpdatePanel control to refresh page content with- 
out performing a post back to the server. You'll also learn how to take advan- 
tage of the rich set of controls included in the ASP.NET AJAX Control Toolkit. 
These controls enable you to create a better client-side experience by dis- 
playing modal dialog boxes, auto-complete combo boxes, cascading drop- 
down lists, and client-side tabs. Finally, you'll learn how to take an existing 
ASP.NET server-side application and improve it (AJAXify it!) by adding 
Microsoft AJAX functionality. Note: You must come to this workshop with a laptop (with 
power cord and CD-ROM drive) that is configured with Visual Studio 2005/Visual Web 
Developer 2005, SQL Server 2005/SQL Server Express 2005 PLUS ASPNET AJAX, and the 
ASPNET AJAX Control Toolkit. There is not time to debug installation issues so please have the 
above up and running. 


PRE-CONFERENCE WORKSHOP > 9AM-I2PM > ASP.NET WORKSHOP 
APR301: INTRODUCTION TO CUSTOM WEB CONTROL DEVELOPMENT 
MIGUEL CASTRO 


Development of custom Web server controls has been pretty much dominated 
by tool vendors for quite some time. Companies like telerik, Developer 
Express, and Infragistics have done a great job of easing the pain that may 
come with ASP.NET development, as well as allowed us to maintain a consis- 
tency between Windows and Web development. But the truth is that the drag- 
and-drop you get with Web controls is where the consistency and similarity 
ends. What is happening behind the scenes of a Windows control and a Web 
control is very different, as is how the host application deals with them. 
Developing them is also quite different and l'm going to show you that in this 
session. l'Il start with a tour of the guts of simple Web controls, and then dive 
into developing a couple of composite controls. I'll cover control rendering, 
child controls, properties, events, styling, and time permitting, touch briefly 
on a couple of advanced technologies like designers and smart tags. 


PRE-CONFERENCE WORKSHOP * 1PM-4PM > ASP.NET WORKSHOP 

APR302: LEARN HOW TO CREATE RICH SILVERLIGHT APPLICATIONS 
DAN WAHLIN 

Silverlight, formerly codenamed WPF/E, provides a new declarative way to 
enhance the end-user experience by adding rich functionality into applica- 
tions. In this session you will learn how to write XAML code that can handle 
performing animations, displaying shapes and text, movies, and images. You'll 
also learn how you can use JavaScript to interact with Silverlight applications 
and dynamically change XAML content. 


WORKSHOPS AND SPEAKERS ARE SUBJECT TO CHANGE. 
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PRE-CONFERENCE WORKSHOP • 9AM-4PM + MOBILE WORKSHOP 

MPR301: BUILDING MOBILE SMART CLIENTS FOR THE ENTERPRISE 
NICKOLAS LANDRY 

Did you know that today's Pocket PCs are more powerful than a 1998 desktop? 
Did you know that the .NET Compact Framework offers over 30% of the full 
framework functionality in about 1096 of the size? Or that you can call standard 
Web services using a Windows Mobile device and a $25/month wireless Internet 
connection? Did you know Tablet Ink support is built-in the platform? Mobile 
devices, applications, pen computing and wireless networks have come a long 
way to become key components of the agile enterprise. There is also a wide 
range of mobile devices and computers you can target, from Smartphones and 
Pocket PCs to Ultra Mobile PCs and Tablet/Mobile PCs. This workshop explores 
mobile enterprise application development for the .NET developer like you. 
Come learn how to leverage your .NET knowledge to develop powerful smart 
clients that work with elaborate architectures to fulfill the promise of a truly 
connected world. Topics discussed include: Visual Studio 2005 & 2008, .NET 
Compact Framework 2.0 & 3.5, Windows CE, Windows Mobile 5.0 & 6.0 SDKs, 
Tablet PC SDK, SQL Server 2005 Compact Edition, the Mobile Client Software 
Factory , Tablet PCs, UMPCs, Pocket PCs, Smartphones, wireless networks 
(GSM/GPRS, EDGE, UMTS/HSDPA, СОМАЛХ, EV-DO, WI-FI, Bluetooth, etc.) and more. 
All this great content is combined with great demonstrations, basic & advanced 
coding techniques, enterprise mobility scenarios and actual live devices, all 
mixed up in true "ActiveNick" style. If mobile development interests you and 
you've got some .NET Framework programming experience, this is the one 
workshop you cannot afford to miss! This is the best way to kick-off Mobile 
Connections and help steer your experience over the next three days. 


PRE-CONFERENCE WORKSHOP > 9AM-4PM * SHAREPOINT WORKSHOP 
SHAREPOINT SERVER 2007 INSTALLATION AND DEPLOYMENT 
BEN CURRY 


This session will begin with a logical architecture of the SharePoint Server 2007 
product. This logical architecture will include a review of the dependencies, 
such as Windows Server, IIS, SQL Server, and WSS v3. There are several installa- 
tion options-and they do not mean what you think they mean! So, there are 
some best practices and lessons learned about the installation process. 


In the first deployment session, | will discuss how to add multiple Web Front 
End (WFE) Servers to a farm, in addition to defining what a farm is and its 
boundaries. The next section will discuss how to scale Search and Indexing. 
This is a two-part section: the first is how to implement the crawling topology 
and configure farm search operations. During the second, | will demonstrate 
how to scale the Query topology to support enterprise implementations. 

To successfully scale search indexing, you must create NetBIOS shares, 
understand how the shadow indexes are propagated, and know how to use 
Perfmon.exe to monitor your usage. Proper usage of perfmon.exe is critical to 
the long-term success of your enterprise search and indexing infrastructure. 
Excel Calculation Services (ECS) scaling is not difficult, but you must under- 
stand it to scale it out correctly. This section will discuss the ECS Web Proxy, 
ECS Web Service, and ECS Services themselves. A discussion here about co- 
hosting with Indexing and Query services is a must. 

You can also load balance Document Conversion Services and | will show you 
how and also demonstrate some best practices. 

Lastly, | will detail and demonstrate the best practices when building an 
Enterprise farm for Web Publishing and sharing content to the Internet. This 
section will include extending and mapping Web Applications, creating multi- 
ple Zones, and implementing Web Application Security Policies. 


NOTE: LUNCH IS INCLUDED WITH FULL DAY WORKSHOPS • THE COST OF A WORKSHOP IS IN ADDITION TO THE REGULAR CONFERENCE FEE 
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POST-CONFERENCE WORKSHOP * 9AM-4PM + SQL SERVER WORKSHOP 
SPS301: A DAY OF SQL SERVER 2005 SECURITY 

DON KIELY 

There are few corporate assets as valuable in the information age as data. 
Enterprises spend billions to collect and generate it, slice and dice it in every 
conceivable way to mine marketplace intelligence from it, and replicate and 
back it up using elaborate, redundant schemes. Yet it is all too common to 
slack on security. Sure, SQL Server 2005 is designed to be "secure by default," 
but once you add databases and start letting users and their applications 
access the server you have already poked holes in the security. SQL Server 
comes with plenty of features that let you secure data, but it can be hard to 
get a handle on the right ones to use in your environment. During this day of 
security, we'll explore myriad security features in SQL Server 2005, including 
granular permissions and how to design an effective authorization system, 
owners and schemas, and how they can help secure a database, the security 
issues and dangers with running SQL-CLR code, how to run T-SQL code in dif- 
ferent security contexts, the comprehensive encryption features that can pro- 
tect data, creating and enforcing password policies, how SQL Server protects 
catalog views and secures metadata, protecting against SQL injection attacks 
on the server, and more. You'll see lots of code and get lots of practical ideas 
for how to secure your database. Prerequisites: You'll need to have a good 
understanding of the basic database features and functions of SQL Server for 
this workshop, and it helps to have butted heads with SQL Server a time or 
two trying to get something to work without completely disabling security. 


POST-CONFERENCE WORKSHOP > 9AM-4PM • SQL SERVER WORKSHOP 


SPS302: SQL SERVER-PUT YOUR KNOWLEDGE INTO ACTION 
(BRING YOUR OWN LAPTOP) 

KIMBERLY L. TRIPP AND PAUL S. RANDAL 

After a week of learning and watching demos-spend your last conference day 
putting your knowledge into action and diving deeper into the implementation 
details. Bring your own laptop to install our VPC environment setup with hands- 
on lab exercises to walk you through some of our most important features in 
Database Maintenance and Disaster Recovery. AII labs will be ILLs (instructor-led 
labs) with supporting hands-on lab content *and* you will walk away with your 
own copy of the DVD to continue the exercises back at your office. You can 
attend without a laptop but your experience will be significantly better with one! 
This is meant as an advanced workshop and will expect a reasonable laptop configuration in 
order to participate: * Virtual Server or Virtual P-—already installed * At least 1 GB of physical 
memory w/512 MB dedicated to the VPC environment (2 GB is preferred w/1 GB dedicated to 
VPC) * 12 GB of physical disk space (20-+ GB is preferred) * DVD Drive 


POST-CONFERENCE WORKSHOP > 9АМ-І2РМ • DATA ACCESS WORKSHOP 
VPS201: ADO.NET 3.5 DATA ACCESS GUIDANCE 

JULIA LERMAN 

ADO.NET 3.5's Entity Framework presents us with a slew of new options for 
accessing data which you will learn about during the DevConnections confer- 
ence. This 1/2 day post-conference session will compare and contrast Object 
Services and Entity Client; Entity SQL, LINQ to Entities, and LINQ to SQL pro- 
viding guidance as to which methods are appropriate in particular scenarios. 
We will also examine this functionality with respect to both creating new 
solutions and plugging the Entity Framework into existing applications. 


POST-CONFERENCE WORKSHOP > 9AM-4PM • VISUAL STUDIO WORKSHOP 
VPS401: ADVANCED WCF 
JUVAL LOWY 


Tired of people explaining to you again what an endpoint is? Read every 
article and found yourself needing more? Had enough of fluffy talks? This 


day is packed solid with advanced topics, often shedding light on poorly 
understood or poorly documented aspects of WCF. You will learn not only 
advanced WCF techniques, but also relevant design guidelines, best prac- 
tices, and pitfalls. The material presented includes Juval's original tech- 
niques and utilities and goes well beyond anything you can find in conven- 
tional sources. Topics discussed include service contracts factoring and 
design, publish-subscribe frameworks, error handling extensions, transac- 
tion propagation modes, service and callbacks synchronization context 
and UI thread management, queued response service, and a declarative 
security framework. Don't miss out on this unique opportunity to master 
WCF with Juval Lowy who has been part of the strategic design effort for 
WCF from the beginning and who offers a profound insight on the technol- 
ogy and its applications. 


POST-CONFERENCE WORKSHOP > 9AM-4PM > ASP.NET WORKSHOP 
APS301: CHALLENGING ASP.NET FOR BRAVEHEARTED DEVELOPERS 
DINO ESPOSITO 


A deep understanding of the internal mechanics of ASP.NET pages may lead 
you to do virtually anything. Or, at least, anything that is not strictly prohibit- 
ed by the internal architecture of the run-time environment. In this workshop 
for bravehearted ASP.NET developers, you'll take up some of these chal- 
lenges. For example, you'll explore ways to modify Web sites without touching 
the source. Useful to inject tracking code in a remote site, the technique 
requires an advanced familiarity with HTTP modules and handlers. You'll also 
see how to catch undocumented run-time events, dissect protected overrid- 
able methods of system controls, and replace declaratively controls in any 
page of a Web site. 


POST-CONFERENCE WORKSHOP > 9AM-4PM + SHAREPOINT WORKSHOP 
HANDS ON TRAINING-SHAREPOINT DEVELOPMENT BOOTCAMP 
(BRING YOUR OWN LAPTOP) 

TODD S. BAGINSKI 

During this post-conference workshop I'll present/teach modules from my 
SharePoint Development Bootcamp. Each module has a short slide presenta- 
tion, followed by a quick demo, and then a lab exercise. 

Attendees will be provided with handouts for all of the modules covered. 
Each handout includes the bullet points in the slides, complete with detailed 
notes, explanations, and screenshots related to each bullet point on the 
slide. Each handout also includes click-by-click lab instructions that describe 
how to accomplish the lab. All code snippets and materials needed to com- 
plete each lab are also provided to the students. 

This workshop will include modules from my bootcamp course, including: 


SharePoint Framework Technical 
Roadmap 

Creating a Custom Site Definition 
How to Diagnose SharePoint Errors 
Advanced ONET.XML 

Advanced SCHEMA.XML 

Creating Custom Master Pages 
Extending the Administration Pages 
Creating Features and Consuming 
their Associated Events 

Features in the ONET.XML 
SharePoint Object Model Introduction 
Working with SharePoint Sites 
Programmatically 
Programmatically Customize Site 
Navigation 

Web Part Development 


Debugging Web Parts 

How to Deploy Code and Resources 
Building the Better Web Part 

Web Part Properties, ToolPanes, 
EditorParts and Menus 

Custom Server Controls in WSS Sites 
WSS Web Services 

Site Properties 

Using Impersonation in Custom Code 
mplementing Forms Based 
Authentication 


* Taking Advantage of the VSeWSS to 


Simplify Development Efforts 
mplementing WWF Workflows created 
in VS 2005 

Getting the Most Out of the Business 
Data Catalog 
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TAX DEDUCTION 

Your attendance to a DevConnections confer- 
ence may be tax deductible. Visit 
www.irs.ustreas.gov. Look for topic 513 - 
Educational Expenses. You may be able to 
deduct the conference fee if you undertake to 
(1) maintain or improve skills required in your 
present job; (2) fulfill an employment condition 
mandated by your employer to keep your 
salary, status, or job. 


GROUP DISCOUNT 
Register individuals from one company at the 


same time and receive a group discount. 
1-3 registrants $1,395 per person 


Additional registrants | $1,195 per person 
after the 3rd ($200 off each) 
(4th, 5th, 6th...) 


Call 800-438-6720 to take advantage of group 
discount pricing. 


LAS VEGAS, NEVADA 


Network with your colleagues at Mandalay Bay 
Resort & Casino! There's so much to do, you'll never 
have to leave this 4-star resort! 


* 11-асге tropical lagoon 

* Sandy beach 

* 3/4 mile lazy river 

* 30,000 sq.ft. luxury spa and fitness center 

* 16 restaurants on site, including The House of Blues 
* 135,000 sq.ft. casino 

* 12,000 seat sports/entertainment complex 

* Shark Reef: Not your typical aquarium! 


HOTEL ACCOMMODATIONS 

Mandalay Bay Resort and Casino, 3950 Las Vegas Blvd. South, Las Vegas, Nevada, 
is the conference site and host hotel. SPACE IS LIMITED so reserve your room early 
by calling the conference hotline at 800-438-6720 or 203-268-3204. 


* NOTE: ROOMS AT MANDALAY BAY HAVE BEEN TOTALLY REMODELED, VERY COOL! 
SPACE IS LIMITED * LAST YEAR ROOMS SOLD OUT EARLY SO BOOK YOUR ROOM TODAY! 


AIRLINE 
Please call Pericas Travel at 203-562-6668 for airline reservations. 


CAR RENTAL 
Hertz is offering auto rental discounts to attendees. Call the Hertz Meeting Desk 


at 800-654-2240 for reservations and refer to code CV# 01080032 to receive 
your attendee discount. 


ATTIRE 
The recommended dress for the conference is casual and comfortable. Please bring along 
a sweater or jacket, as the ballrooms can get cool with the hotel's air conditioning. 


SPONSORSHIP/EXHIBIT INFORMATION 
For sponsorship information, contact Rod Dunlap 480-917-3527 phone 
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Join the revolution with this 
step-by-step guide to integrating 
Reporting Services and MOSS 


SSRS. You can run one instance in native mode, 
keeping these features available in your environ- 
ment, and the other instance in integrated mode, 
letting you take advantage of the MOSS features 
that enable better information access. These two 
instances can even coexist on the same server, 
although you might achieve better performance 
by installing them on separate servers. 


Inside the Architecture 

Before installing the components required for 
SSRS-MOSS integration, you need to under- 
stand the difference between a single-server 
deployment and a distributed deployment so 
that you can select the one best suited for your 
environment. 

Single-server deployment. 'The simplest inte- 
grated architecture includes all components on 
one server, as Figure 1 shows. This approach 
works when you have a limited number of users 
and reports. However, SSRS alone requires a lot 
of memory and will compete for memory with 
SQL Server and MOSS. 

Let's review each component' role in the in- 
tegrated environment. Installing the Microsoft 
SQL Server 2005 Reporting Services Add-in 
for Microsoft SharePoint Technologies updates 
your MOSS installation to include a new SSRS 
proxy endpoint, a new Web Part for viewing in- 
tegrated-mode reports, and application pages for 
storing and managing integrated-mode reports. 

MOSS databases host the SSRS configura- 
tion information as well as SSRS content. Spe- 
cifically, the MOSS configuration database stores 
the information you provide when you configure 
SSRS for the MOSS instance. The content da- 
tabase is the primary storage location for report 
server items such as report definition files, report 
models, and data sources as well as for report 
server item properties. In addition, the MOSS 
content database stores security permissions for 
report server resources. 

Installing SQL Server 2005 SP2 on your re- 
port server adds a new Simple Object Access 
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Protocol (SOAP) endpoint for integrated re- 
port management and new extensions that let 
MOSS manage security and enable subscription 
delivery to а MOSS document library. SP2 also 
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modifies the Reporting Services Configuration 
Tool to support MOSS integration and to gener- 
ate database scripts for integrated-mode report 
server databases. 

The report server uses the same two databases 
that it uses in native mode but with a slight vari- 
ation. The report server database is the primary 
storage location for report snapshots, report his- 
tory, schedules, and subscriptions. In ad- 
dition, this database serves as secondary 
storage for the report definition files, re- 
port models, data sources, and their prop- 
erties, avoiding repeated transmission of 
files to the report server for processing. As 
in native mode, the report server temporary da- 
tabase stores session data and cached instances. 

When you add, modify, or execute a report on 
the MOSS server, the report server first compares 
the item and its properties from the MOSS site 


Figure 1 


Single-Server deployment 


ORE on the WEB 


Read the Web-exclusive sidebars 
at InstantDoc IDs 96841 and 
96842.. 
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TABLE 1: SSRS Service Account Requirements 


Deployment 
Single-server 


Remote SQL Server with report server on 
same server as SharePoint databases 


Remote SQL Server with report server on 
separate server from SharePoint databases 


SharePoint standalone 
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TABLE 2: Privileges Required for Integration 


Service Accounts Allowed Configuration Tasks 
Local system Task Privileges Required 
Install SP2 Local administrator 


Network service 
Domain user account 


Domain user account 


Network service Web application 


Domain user account 


Configure the report server 
Install the SSRS add-in on the MOSS server 
Configure SSRS integration on the MOSS 


Create and modify MOSS security policies 


Local administrator 
MOSS farm administrator 


MOSS farm administrator and 
site collection administrator 


MOSS site owner 


for report server content and operations 


Network service 
Domain user account 


with the corresponding copy (if one exists) in the 
report server. If the report in MOSS is new or has 
a later timestamp, a synchronization process cop- 
ies the new version to the report server database. 

Even in integrated mode, the report server 
performs all data processing, rendering, sub- 
scription delivery, and snapshot and history 
generation. The main difference between native 
and integrated mode is the storage location for 
report server content and properties. 

Distributed deployment. Distributed deploy- 
ments come in many variations, but I limit my 
review to two approaches that illustrate key con- 
figuration points. You can set up MOSS on one 
server and store its configuration and content 
databases on a remote server. I call this the re- 
mote SQL Server deployment method and as- 
sume that SSRS also is installed on the remote 
server. Or, you can install all MOSS compo- 
nents, including the databases, on one server and 
install SSRS on a second server. I call this the 
MOSS stand-alone deployment method. 

Alternatively, you can create a MOSS farm to 
support multiple Web front-end servers. SSRS 
integrates with the farm itself, not with any par- 
ticular MOSS server in the farm, and uses only 
one report server database for the farm. If the 
MOSS farm has multiple Web applications, each 
Web application can use different authentication 
providers. For example, one Web application 
could use NTLM authentication while another 
uses Kerberos, and the report server would in- 
teract with each MOSS server by using the ap- 
propriate authentication mode. 


Installation Prerequisites 

Because this article is about SSRS-MOSS in- 
tegration, I assume that you have the report 
server installed correctly and, if you're plan- 
ning a distributed deployment, that you have 
MOSS installed on another server. The first 
step to installing the integration components 


is to download and install on the report server 
the Microsoft . NET Framework 3.0 Redistrib- 
utable Package, available at http://go.microsoft 
.com/fwlink/?linkid=72322&clcid=0x409. If you 
haven't already, you also need to download SQL 
Server 2005 SP2 from http://go.microsoft.com/ 
fwlink/?linkid=82758. Then you need to down- 


load the Microsoft SQL Server 2005 Reporting 


Services Add-in for Microsoft SharePoint Tech- 
nologies from http://go.microsoft.com/fwlink/ 
?linkid-82753. 


Using the Reporting Services Configuration 


Tool, which you launch after installing SP2, you 
must configure a service account for the SSRS 
Windows service and its Web service. The service 
accounts will later require write and execute per- 
missions on the MOSS content database to let the 
report server add, change, or delete report server 
items and properties. These permissions are as- 
signed when you configure the MOSS Web appli- 
cation for SSRS integration. But for now, make 
sure you select the correct type of service account 
for your environment, as Table 1 shows. 

If you plan to use domain user accounts as the 
SSRS service accounts, create them now. You also 
need to know the username and password for the 
account that MOSS uses to connect to its data- 
bases. In addition, make sure you have the nec- 
essary privileges, or schedule time with a person 
who has them, before performing the five integra- 
tion configuration tasks that Table 2 describes. 


Installing and Configuring MOSS 
on the Report Server 

You start the integration setup by installing SP2 
on the report server. Next, install the Windows 
SharePoint Services (WSS) object model on the 
same server, following the appropriate installa- 
tion steps for your deployment mode, as follows. 
(For information about using WSS or MOSS 
Standard Edition instead of MOSS Enterprise 
Edition, see the Web sidebar “What Can I Ac- 
complish with Other SharePoint Technologies?” 
InstantDoc ID 96841.) 
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Single-server. In a single-server deployment, 
the object model installs when you perform 
an Advanced installation of Microsoft Office 
SharePoint Server 2007 Enterprise Edition and 
select the Complete server type. When installa- 
tion is complete, reboot if prompted, and run the 
SharePoint Products and Technologies Configu- 
ration Wizard, which you find in the Microsoft 
Office Server program group. On the wizard's 
Connect to a Server Farm page, select No, I want 
to create a new server farm. Type the name of 
your SQL Server system, and provide a domain 
account for MOSS database access. Then select 
an authentication provider—NTLM or Kerbe- 
ros—and click Finish. 

When the wizard completes, the SharePoint 
Central Administration page opens. To com- 
plete a basic installation, first start the Office 
SharePoint Server Search service to support in- 
dexing and searching of MOSS content. Click 
the Operations tab, click Services on server in the 
Topology and Services section, and click Start 
next to Office SharePoint Server Search. On the 
following configuration page, select the Use this 
server for indexing content and Use this server for 
serving search queries check boxes. You must also 
provide an email address for an administrator to 
be contacted if indexing problems occur and a 
domain account and password to run the search 
service, then click Start to start the service. 

Next, configure a new Web application to 
create the SharePoint site on your Microsoft IIS 
server. To do this, in SharePoint Central Admin- 
istration, click the Application Management 
tab, click Create or extend Web application in the 
SharePoint Web Application Management sec- 
tion, then click Create a new Web application. On 
the next page, keep the default settings, changing 
the authentication provider to Kerberos if your 
deployment requires it instead of NTLM. You 
must also specify a domain account or select 
a built-in account for the application pool as- 
signed to this Web application. If you have a re- 
mote SQL Server deployment, be sure to change 
the database server name. The MOSS content 
and configuration databases will be created on 
the specified server. Click OK. 

The last configuration step in this deployment 
model requires you to create a site collection for 
the Web application you created. A site collec- 
tion is essentially a Web page that displays a tab 
for each MOSS site you create in that collection. 
Each site has one or more Web pages that you can 
use to organize content to support a specific goal, 
such as storing reports in a library or managing 
collaborative activities. On the Application Man- 


agement page, in the SharePoint Site Manage- 
ment section, click Create site collection. Provide 
a title for the site collection, and set the URL for 
the Web Site Address to http://your_server/sites/ 
Reports, for example. In the Template Selection 
section, click the Enterprise tab, and select Report 
Center. This site template adds to your MOSS 
site a set of predefined Web pages that you'll use 
later to manage reports and data sources. Spe- 
cifically, a Report Center site contains the docu- 
ment libraries and other business intelligence (BI) 
features that make information integration easy. 
Type the domain account of the user who will be 
the primary administrator of the site collection, 
and optionally provide a domain account for a 
secondary site collection administrator. Click OK. 
to create the site collection. 

Distributed servers. For a distributed deploy- 
ment, the WSS object model installs when you 
run the MOSS Web front-end setup on the re- 
port server. Note that you can't mix SharePoint 
technologies, which means you can't install WSS 
on your report server and then integrate SSRS 
with a MOSS server or vice versa. 

Start the MOSS setup program on the report 
server, choose the Advanced installation option, 
and then select the Web front end option. When 
the installation completes, reboot if prompted, 
and then run the SharePoint Products and Tech- 
nologies Configuration Wizard. On the Connect 
to a Server Farm page, select Yes, I want to con- 
nect to an existing server farm. On the Specify 
Configuration Database Settings page, type the 
name of the database server, click Retrieve Data- 
base Names, and select the MOSS configuration 
database, which by default is SharePoint. Con- 
fig. Provide the password for the account that the 
Web front end uses to connect to this database, 
select either NTLM or Kerberos authentication 
on the next wizard page, and then click Finish. 
Finish the distributed deployment installation 
by configuring a new Web application and site 
collection as described in the previous section. 

Reporting Services configuration. If your 
MOSS Web application uses the default port 80, 
which is the case for a default MOSS configura- 
tion, you must reassign the Web site that's hosting 
the Report Server Virtual Directory to an unused 
port, such as 8080, and be sure that Web site 1s 
started in IIS. To reassign the report server's Web 
site port, open Internet Information Services 
(IIS) Manager in the Administrative Tools pro- 
gram group, expand the server hosting SSRS, and 
expand the Web Sites folder. Right-click the De- 
fault Web Site (or the Web site you've assigned 
the SSRS virtual directories to), and click Prop- 


SQL Server Magazine * www.sqlmag.com 


erties. In the Default Web Site Properties dialog 
box, on the Web Site tab, type the number of the 
unused port in the TCP port box. Keep the dialog 
box open for subsequent steps. 

Next, verify authentication on the Web site 
to ensure that anonymous access is disabled 
and that integrated Windows authentication is 
enabled. In the Default Web Site Properties dia- 
log box, click the Directory Security tab. In the 
Authentication and access control section, click 
the Edit button. In the Authentication Methods 
dialog box, clear the Enable anonymous access 
check box and select the Integrated Windows au- 
thentication check box. 

Last, revise the RSReportServer.config file 
(which you can find at Program Files\Microsoft 
SOL Server MSSQL.nWeporting | Services Re- 
portServer) by changing the URLRoot property 
to include the port number (e.g., http://your_serv- 
er:8080/reportserver); this provides the correct 
link to reports in email subscriptions. Before con- 
tinuing, restart IIS, and then test the URL in your 
browser to make sure SSRS works correctly. 


Setting Up Integrated Mode 

You're now ready to prepare the report server for 
integrated mode. First, use the Reporting Ser- 
vices Configuration Tool to change the Windows 
Service Identity and Web Service Identity to use 
a domain user account if the MOSS databases 
are on the same machine as the report server in 
a remote SQL Server deployment. Start the Re- 
porting Services Configuration Tool by clicking 
Start, pointing to All Programs, Microsoft SQL 
Server 2005, Configuration Tools, and clicking 
Reporting Services Configuration. Select the cor- 
rect SSRS instance by specifying the report serv- 
er name and selecting the SSRS instance when 
prompted, then click Connect. Click Windows 
Service Identity in the left pane of the configura- 
tion tool’s window, then select Windows account, 
provide the domain username and password cre- 
ated as explained in the earlier Installation Pre- 
requisities section, and click Apply. 

To change the Web Service Identity, you 
use ITS Manager, but first confirm the applica- 
tion pool that SSRS is using. Click Web Service 
Identity in the left pane of the Reporting Ser- 
vices Configuration Tool, and in the drop-down 
list next to Report Server, note the name of the 
application pool. Don’t worry about the Report 
Manager application pool’s identity because you 
won't use it after you switch the report server to 
integrated mode. Keep the configuration tool 
open, and switch to IIS Manager. Expand the 
server hosting SSRS, expand the Application 
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Pools folder, right-click the application pool you 
identified for Report Server in the configuration 
tool, click Properties, and click the Identity tab. 
Select Configurable, provide the domain user- 
name and password, and click OK. Remember 
that this step 1s necessary only for a remote SQL 
Server deployment. 

To convert the report server to integrated 
mode, switch back to the Reporting Services 
Configuration Tool, click Database Setup in the 
left pane, connect to the database server, then 
click Change. Click Yes to confirm the request 
to change the report server mode. In the result- 
ing SQL Server Connection dialog box, provide 
a new report server database name, confirm that 
the Create the report server database in Share 
Point Integrated mode check box is selected, as 
Figure 2 shows, and click OK. At the bottom of 
the Database Setup 
page, click Apply, 
and then click OK 
to confirm the cre- 
dentials used to fi- 
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Configuration Tool 
is now disabled be- 
cause the report 
server is in integrat- 
ed mode. Also notice 
the red status icon 
next to the Share- 
Point Integration link in the left pane. This icon 
means SharePoint Integration is not configured. 
The status will change to Configured (a green 
icon) after you configure the MOSS server for in- 
tegration, which I cover next. 

Before closing the Reporting Services Config- 
uration Tool, make sure you back up the encryp- 
tion key. You'll need the encryption key backup 
if you ever need to restore the report server da- 
tabases. Without this backup, after restoring the 
report server databases, you must delete encrypt- 
ed data such as data sources or user information 
because there would be no way to decrypt it. To 
create the backup, click Encryption Keys in the 
left pane, click Backup, and type a password. 
Click the ellipsis button to the right of the Key 
File box, and then in the Save As dialog box, 
navigate to a secure folder or removable media 
location, provide a file name, and click Save. 

A common question from report server ad- 
ministrators planning to convert an existing 
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Figure 3 


Managing integration 
settings 


SSRS server from native to integrated mode 15 
how to migrate the existing report server con- 
tents. Unfortunately, there's no migration util- 
ity to transfer the contents of the native mode 
report server database to the integrated mode 
version of the database. Instead, you have to re- 
deploy existing reports to the MOSS server and 
reconfigure execution and security settings of 
each report in MOSS. 


Installing and Configuring the 
SSRS Add-in on the MOSS Server 
Now, you're ready to install the SSRS add-in on 
the MOSS server. If you have a MOSS farm, you 
must install the add-in on all Web front-end serv- 
ers in the farm except the report server. When 
you've updated all the servers, you can configure 
SSRS integration in MOSS. Remember from Ta- 
ble 2 that you must use a domain user account that 
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has MOSS farm administrator and site collection 
administrator privileges to perform this task. To 
begin the SSRS configuration, click Start, point 
to All Programs, point to Microsoft Office Server, 
and click SharePoint 3.0 Central Administration. 


Click the Application Management tab. The add- 
in has updated this page to include a Reporting 
Services section. Click Manage integration set- 
tings to specify the URL for the report server 
(e.g., http://your server:8080/reportserver) and 
set the authentication mode, as Figure 3 shows. 
Click OK when finished. (For information about 
which authentication mode to use, see the Web 
sidebar “The Differences Between Authentica- 
tion Modes,” InstantDoc ID 96842.) 

On the Application Management page, click 
Grant database access. Provide the report server 
name and instance in preparation for granting 
permissions to the SSRS Windows and Web 
services to access the MOSS databases. After 
you click OK, you're prompted for report server 
administrator credentials. If you have trouble 
here, you might need to reapply the settings on 
the Windows Service Identity and Web Service 
Identity pages of the Reporting Services Config- 
uration Tool. After you properly configure the 
service accounts, the status of SharePoint Inte- 
gration in the Reporting Services Configuration 
Tool should be Configured (green). 

Optionally on the Application Management 
page, you can click Set server defaults to provide 
default settings for report history and logging in 
the Reporting Services Server Defaults page, as 
Figure 4 shows. If an error displays when you 
try to open this page, make sure you've created a 
Web application and site collection. Also be sure 
you're logged in using a site collection admin- 
Istrator account. Finally, be aware that some- 
times the configuration changes you make in the 
Reporting Services section require a restart of 
IIS before you can open the Reporting Services 
Server Defaults page. 

Your next step is to configure Component 
Services with the correct permissions for the 
SSRS Web Service Identity to activate MOSS. 
On the MOSS server, click Start, point to Ad- 
ministrative Tools, click Component Services, 
and then expand Computers, My Computer, 
DCOM Config. Right-click IIS WAMREG 
Admin Service, and click Properties. On the 
Security tab, click the Edit button for Launch 
and Activate Permissions. Click Add and type 
the built-in account or domain user account you 
assigned to the Report Server application pool. 
Alternatively, you can click the Advanced but- 
ton to search for and select the account. After 
you add the account, highlight it in the Group or 
user names list in the Launch Permission dialog 
box, then select the Local Activation check box 
in the Allow column. Click OK twice to close all 
dialog boxes and save the permission changes. 
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SSRS AND MOSS OVER OR 


Last, update the document libraries in the Report Builder Report content type; the Report 
MOSS Web application to use the SSRS content Builder application displays a list of all models 
types. You can make these changes to any docu- іп the same library in which the user creates the 
ment library in your MOSS Web application, but new Report Builder Report. If the model is in a 
I limit my explanation to the Reports and Data different library, the user must click the default 


Connections libraries that are part of the stan- library link in the Getting Started pane of Re- 
dard Report Center site. In Internet Explorer рогі Builder, must know which MOSS library Editor's Note 
(IE), navigate to the Reports document library contains the desired model, and must navigate 


in your MOSS Web application. For example, if to that location. For important information 
the URL you assigned to the Web application about migrating reports 
is http://your_server/sites/Reports, go to http:// Successful Configuration from SSRS to MOSS, see 


your_server/sites/Reports/ReportsLibrary to open Ву following these steps, you have successfully (в Web-exclusive sidebar 
the Reports library. On the Settings menu, click installed and configured the components re- 
Document Library Settings. Inthe Content Types quired to integrate SSRS with MOSS Enterprise Me » 
| : pde SN . : ervices to MOSS” at 
section, click Add from existing site content types. Edition. (For more information about these InstantDoc 10 96975. It’s 
In the Available Site Content Types list, select Re- steps, see the Microsoft resources listed in the  . rmm 
port Builder Report, click Add, and click OK. Learning Path at InstantDoc ID 96840.) In the троп 90-001; that 
Report Center includes the Data Connections second article in this series, ГЇЇ show you how Many report properties 
library to restrict access to data source files and ќо perform familiar SSRS tasks, such as deploy- are not migrated from 
related management tasks. To add the SSRS con- ment, security implementation, and property SSRS and must be 
tent types for data sources, open the library, and management, іп the new MOSS environment. reconfigured in MOSS. 
repeat the steps you followed to add report con- ГЇЇ also show you how to take advantage of new 
tent types, except select Report Data Source. features enabled by MOSS, such as versioning, 
You can add the Report Builder Model con- workflow, alerts, information management poli- 
tent type to either the Reportslibrary orthe Data — cies, and how to achieve BI information integra- 
Connections library. I find it easier for users to tion with SSRS and MOSS. [SQL 
use a report model if it’s in the same library as the InstantDoc ID 96840 
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Grouping Sets, 


PART 1 


SQL Server 2008 introduces enticing 
functionality that you can learn now 


want to devote this article to a cool concept 
| that’s been around for a while but has gained 
some new functionality in SQL Server 2008. Рт 
talking about grouping sets, which are very useful for 
analyzing aggregated data. They let you provide data- 
analysis solutions against the relational database as 
alternatives to Analysis Services when appropriate. 
Now, I know many of you haven't even thought 
about checking out SQL Server 2008 yet, but I 
would urge that you do so. What better way to be 
prepared for the new version when it arrives? Check 
out the Web-exclusive sidebar “Getting Your Feet 
Wet in SQL Server 2008" (InstantDoc ID 96802) 
for information about downloading a Community 
Technology Preview (CTP) of SQL Server 2008. 
With that in mind, let's examine grouping sets. 
All the code samples in this series of articles about 
grouping sets will be against an Orders table created 
in the tempdb database. (Assuming you have the 
latest CTP version of SQL Server installed, you 
can run all the code samples that ГЇЇ provide.) Run 
the code in Listing 1 to create the Orders table and 


populate it with 
ORE on the WEB 


sample data. As 
Download the listings and see ап aside, those of 
the Web tables at InstantDoc you with a keen 
ID 96805, and read the Web- 


exclusive sidebar at InstantDoc ID 96802. У will notice 
something special 


about the INSERT statement in Listing 1—it utilizes 
a small but very handy new T-SQL feature in SQL 
Server 2008 called Row Value Constructors. That is, 
you can now insert multiple rows into a table with 
a single INSERT VALUES statement. You simply 
encapsulate each row’s set of values in parentheses 
and separate the rows with commas. 


Grouping Sets 
Whenever learning about a new feature, a good way 
to start is to understand what kind of problems that 
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feature helps you solve and how you solved such 
problems in the past. A grouping set isn’t a new 
concept—it's nothing more than the set of elements/ 
attributes that you group by. For example, the set of 
attributes in any traditional GROUP BY query is 
a grouping set. SQL Server 2008 introduces a new 
option/subclause called GROUPING SETS, which 
lets you define multiple grouping sets in the same 
query. Logically, a query with the GROUPING 
SETS option produces one result set as if unifying 
the result sets of multiple GROUP BY queries. It’s 
not that you couldn’t achieve the same thing in the 
past, but with the GROUPING SETS option, you 
can write much less code, your query will be much 
more efficient, and you'll have more flexibility than 
in the past. 

As an example, sup- 
pose you need to query 
the Orders table and 
return total quantity 
values for each (customer, 
employee and order year), 
and for each (customer 
and year), and for each 
(employee and year), and 


SET NOCOUNT ON; 
USE tempdb; 
GO 


DROP TABLE dbo.Order 
GO 
CREATE TABLE dbo.Order 
( 


IF OBJECT, ID('dbo.Orders', 


5; 


S 
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LISTING 1: Code to Create and 
Populate the Orders Table 


'U') IS NOT NULL 


orderid INT NOT NULL 

also for (the grand total). orderdate DATETIME NOT NULL, 
- ette. INT NOT NULL 

Yes, you can obtain all custid  VARCHAR(5) NOT NULL, 
result sets with four qty INT NOT NULL, 


CONSTRAINT PK Orders PRIMARY KEYCorderid) 


GROUP BY queries, as 
you see in Listing 2, page 
.26. But suppose you want 


›; 


INSERT INTO dbo.Orders 


: VALUES 
to unify all four result sets (30001, '20060802', 
: (10001, '20061224', 
into one. In other words, (10005, '20061224', 
you want one result set Е RU 

ifvi i (20001, '20070212', 
unifying four grouping CRT EC certe: 
sets: (customer, employee, саш annsna oi 

К . (30003, '20080418', 
year) (customer, year), (30004, '20060418'. 
(employee, year), and 0 120060907", 


О. One option is to use 


Corderid, orderdate, empid, 


3 
1 
1] 
4 
1 
2 
4 
2) 
3 
5 
3 


жы ANN SS SS SSS 


custid, qty) 


10), 
12), 
20), 
40), 
14), 
12), 
10), 
20), 
15), 
220] 
30); 
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LISTING 2: 4 GROUP BY Queries 


SELECT custid, empid, YEAR(orderdate) AS 
orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY custid, empid, YEAR(orderdate); 


SELECT custid, YEARCorderdate) AS orderyear, 
SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY custid, YEARCorderdate); 


SELECT empid, YEARCorderdate) AS orderyear, 
SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY empid, YEARCorderdate); 


SELECT SUM(qty) AS qty 
FROM dbo.Orders; 


LISTING 3: Unifying 4 Grouping Sets 
with UNION ALL Б jus 


SELECT custid, empid, YEAR(orderdate) AS 
orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY custid, empid, YEAR(Corderdate) 


UNION ALL 


SELECT custid, NULL AS empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY custid, YEARCorderdate) 


UNION ALL 


SELECT NULL AS custid, empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY empid, YEARCorderdate) 


UNION ALL 


SELECT NULL AS custid, NULL AS empid, NULL AS 
orderyear, SUM(qty) AS qty 
FROM dbo.Orders; 


LISTING 4: Unifying 4 
Grouping Sets with the 
GROUPING SETS Option 


SELECT custid, empid, YEARCorderdate) AS 
orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY GROUPING SETS 

( 
€ custid, empid, YEARCorderdate) ), 


€ custid, YEAR(Corderdate) 217 
€ empid, YEARCorderdate) Dor 
1%) 

; 


UNION ALL set opera- 
tions between the sets. 
Because all four result 
sets originally have dif- 
ferent structures/schema, 
and set operations require 
that all input sets have the 
same schemas, you can use 
NULLs as placeholders 
for the missing elements. 
Listing 3 shows the code 
that unifies all four result 
sets. The problem with 
this solution is that the 
code is lengthy and is 
very inefficient because 
the base data (the rows in 
the Orders table) is going 
to be fully scanned four 
times. 

SQL Server 2008 
lets you write much 
shorter and more effi- 
cient code by using the 
GROUPING SETS 
option in the GROUP 
BY clause, where you 
simply list your desired 
grouping sets. Listing 4 
shows the new, shorter, 
and more efficient alter- 
native to Listing 3’s 
solution. Web Table 1 
(http://www.sqlmag 
.com, InstantDoc ID 
96805) shows the output 
of the query in Listing 4. 

Note the use of 
empty parentheses, sig- 
nifying a grouping set 
with no attributes. As an 
aside, I never felt quite 
comfortable with the fact 
that when I wanted to 
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aggregate the entire input set of rows, I had to omit 
the GROUP BY clause altogether. For example, in 
the past, when you wanted to return the grand total 
quantity, you wrote a query such 


SELECT SUM(qty) AS totalqty 
FROM dbo.Orders; 


It always felt a bit awkward to call such a query a 
GROUP BY query; it's а GROUP BY query, but 
the GROUP BY clause is implicit. As a byproduct 
of adding native support for multiple grouping sets, 
SQL Server 2008 lets you write the following explicit 


alternative to the above query: 


SELECT SUM(qty) AS totalqty 
FROM dbo.Orders 
GROUP BY О); 


Back to the query in Listing 4 using the GROUPING 
SETS option, it’s evident that the code is much 
shorter than the code in Listing 3, with the UNION 
ALL set operations. As for efficiency, the beauty of 
the SQL Server 2008 solution is that the base data 
(the rows in the Orders table) needs to be scanned 
fewer times, because SQL Server can calculate 
higher-level aggregates on top of lower-level aggre- 
gates. ГЇЇ leave you with a full examination of the 
execution plans for both solutions as an exercise 
because I find them to be clear and straightforward. 
But I want to highlight a couple of key points. 

In the plan for the code in Listing 3, you'll find 
four branches (one for each GROUP BY query). 
Each branch starts with a clustered index scan, fol- 
lowed by one aggregate operator operating on the 
whole input set. (There will also be either sorting 
or hashing to support the aggregate operation.) In 
the plan for the code in Listing 4, you'll find fewer 
clustered index scans (two in this case), and in some 
(or all) branches, you'll find multiple aggregate 
operators, where the non-first aggregate operator in 
the brunch aggregates the pre-aggregated data. If 
you compare the costs of the two plans, you'll find 
that the one for the code in Listing 4 is substantially 
cheaper than the one for the code Listing 3. 


I vs. Multiple Grouping Sets 
Every casual GROUP BY query can be thought of 
as having a single grouping set. For example, the fol- 
lowing two queries are logically equivalent: 


-- Query 1 

SELECT custid, empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY custid, empid, 
YEARCorderdate); 


-- Query 2 

SELECT custid, empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 

FROM dbo.Orders 

GROUP BY GROUPING 5ЕТ5( (custid, 
empid, YEARCorderdate)) ); 


Note that you shouldn't confuse the use of 
GROUPING SETS in Query 2 with the following: 


-- Query 3 

SELECT custid, empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 

FROM dbo.Orders 
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GROUP BY GROUPING 5ЕТ5( custid, empid, 
YEARCorderdate) ); 


Syntactically, all I did in Query 3 was remove 
one layer of parentheses; however, logically, I 
defined three grouping sets: (custid), (empid), and 
(YEAR(orderdate)) as opposed to one in Query 2: 
(custid, empid, Y EAR(orderdate) ). As I mentioned 
earlier, Query 2 is logically equivalent to Query 1 (a 
single traditional GROUP BY query), whereas Query 
3 is logically equivalent to UNION ALL operations 
between three traditional GROUP BY queries: 


SELECT .... 
UNION ALL 
SELECT ... 
UNION ALL 
SELECT .... 


GROUP BY custid 
GROUP BY empid 


GROUP BY YEAR(Corderdate); 


Run the three queries above and see for yourself. 
Query 1 and Query 2 produce the output that 
Web Table 2 shows, and Query 3 produces the 
output that Web Table 3 shows. 

In other words, within the parentheses of the 
GROUPING SETS option, commas are used as a 
separator between grouping sets. Within a particular 
grouping set, commas are used as a separator between 
the grouping set's elements or attributes. For clarity, 
you might want to use parentheses surrounding each 
grouping set even when the grouping set contains a 
single element. For example, the following 1s logically 
equivalent to Query 3: 


SELECT custid, empid, YEARCorderdate) 
AS orderyear, SUM(qty) AS qty 
FROM dbo.Orders 
GROUP BY GROUPING SETS 
( 
( custid dy 
( empid 9% 
( YEARCorderdate) ) 
; 


Mixing Grouping Sets and 
Grouping Sets Algebra 

Youcan have multiple GROUPING SETS subclauses 
in the same GROUP BY clause. If you specify mul- 
tiple ones, something interesting will happen: You'll 
get a Cartesian product or a cross of the grouping 
sets in the various GROUPING SETS subclauses. 
This probably sounds very cryptic, and would be 
best explained through examples. Given grouping sets 
(A), (B), (C), (D), and (E), where each capital letter 


GROUPING SETS 


GROUPING SETSC (A), (В), (С) D, 
GROUPING 5ЕТ5( (р), (E) ) 


is logically equivalent to 


GROUP BY 
GROUPING 5ЕТ5( (A, D), (B, D), (С, 
D, (A, D, (B, D, (C, E) ) 


How about a more tangible example? The code in 
Listing 5 has two GROUPING SETS subclauses. 
The first GROUPING SETS subclause has three 
grouping sets: 


GROUPING SETS 


( 
( custid, empid 2, 
( custid Уу; 
( empid ) 
) 


The second GROUPING SETS subclause has two 
grouping sets: 


GROUPING SETS 
( 
( YEARCorderdate), 
MONTH(Corderdate) ), 
( YEARCorderdate) 


m 
» 


LISTING 5: Mixing Grouping Sets 


) 
) 
You get six grouping sets sELECT. 
as a result of the Car- eee 


tesian product between 
the three and the two. 

Listing 6 shows the GROUP BY 
logical equivalent to the Pro PENS ASETE 
code in Listing 5. All six 


SUM(qty) AS qty 
FROM dbo.Orders 


€ custid, empid ), 


p В С custid 9 
grouping sets are speci- ( empid ) 
fied іп one GROUPING ОЕА 
SETS subclause. е 

Finally, if all grouping ( YEARCorderdate) 


; 


LISTING 6: Logical Equivalent of the 
Code in Listing 5 


SELECT 
custid, 
empid, 
YEARCorderdate) AS orderyear, 
MONTHCorderdate) AS ordermonth, 
SUM(qty) AS qty 
FROM dbo.Orders 
GROUP BY 
GROUPING SETS 
( 


YEARCorderdate) AS orderyear, 
MONTHCorderdate) AS ordermonth, 


€ YEARCorderdate), MONTH(Corderdate) 2, 
) 


represents a set of elements in the grouping set (e.g., custid, empid, YEAR(orderdate), MONTH(orderdate) 


( Du 
С custid, empid, YEARCorderdate) ) 
A stands for the set of elements al, a2, 2283 ап), the ‘ custid, 2 оаа ооу MONTHCorderdate) 
1 . custi orderdate 
following pseudo GROUP BY clause: ( empid, YEARCorderdate), MONTH(Corderdate) )2 
€ empid, YEARCorderdate) ) 


GROUP BY Ур 
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LISTING 7: Pulling Out 


Common Elements 


SELECT 
custid, 
empid, 
YEARCorderdate) AS orde 
MONTHCorderdate) AS ord 
SUM(qty) AS qty 
FROM dbo.Orders 
GROUP BY 
YEARCorderdate), 
GROUPING SETS 
( 
€ custid, empid ), 
€ custid da 
€ empid ) 


GROUPING SETS 
( 
€ MONTHCorderdate) ), 


O 
2; 
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GROUPING SETS 


sets have a common element, you can pull them out- 
side all GROUPING SETS subclauses into their 
own GROUPING SETS subclause, and by doing 
so, shorten your code. For example, consider the 
following pseudo GROUP BY clause: 


GROUP BY GROUPING SETS( (a, b, c, d), 
(a, b, с), (a, b, д, (a, b) ) 


All grouping sets have the common elements “a, 
b,"—hence, this can also be expressed as 


GROUP BY GROUPING SETS ( (a, b) ), 
GROUPING SETSC (c, 4), CO, (D, © ) 


Because GROUP BY GROUPING SETS ( (a, b) ) 
is logically equivalent to GROUP BY a, b, you can 
also express this as 


GROUP BY а, b, GROUPING 5ЕТ5( (с, 4), 
(с), (4), О) 


As а more tangible example, the queries in List- 
ings 5 and 6 are logically equivalent to the query in 
Listing 7; the common element YEAR(orderdate) 
was pulled out. 


What's Next? 

If grouping sets seem to be somewhat familiar, it's 
not by chance. You might have already realized that 
the GROUPING SETS option is similar to the 
CUBE and ROLLUP options in previous versions 
of SQL Server. In fact, you can consider CUBE 
and ROLLUP as special cases of the GROUPING 
SETS option. However, 
the CUBE and ROLLUP 
options were implemented 
in a rigid/non-flexible way 
prior to SQL Server 2008 
and are also non-ISO com- 
pliant. SQL Server 2008 
introduces new, ISO-com- 
pliant, more flexible CUBE 
and ROLLUP options to 
replace the older ones. In 
a later article, I'll tell you 
all about the new CUBE 
and ROLLUP options, 
and also about the new 
GROUPING ID function 
that you'll probably find useful in conjunction with 
grouping sets. 

In future articles, ГЇЇ cover other T-SQL enhance- 
ments in SQL Server 2008, including the new 
MERGE statement, Table-Valued Parameters, the 
HEIRARCHYID datatype, and temporal datatype 
enhancements (DATE, TIME, DATETIME2, 
DATETIMEOFFSET datatypes). Ед 

InstantDoc ID 96805 
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THE LOGICAL PUZZLE 


SEPTEMBER'S PUZZLE: 
PROBABILITIES IN CHINA 

Is it possible to prove statistically that there must be 
at least two people in China who have the same num- 
ber of hairs on their heads? Try to stick to pure prob- 
ability and not to assumptions such as, "There must 
be many bald people in China." Also, is it possible to 
prove statistically that there must be at least two peo- 
ple in China with the same arrangement of teeth (i.e., 
missing or existing in the same positions)? Again, try 
to stick to pure probability and not to assumptions 
such as, "There must be many old people with no 
teeth, or people with no missing teeth." (I got these 
two nice puzzles from my friend, SQL Server MVP 
Marcello Poletti.) 


The answer to the first puzzle is yes. There are 
more than a billion people in China, and there are 
fewer than a billion hairs on a human head. Because 
there are fewer hairs on a human head than there are 
people in China, it's impossible that every person in 
China has a unique number of hairs. Therefore, there 
must be at least one number that occurs at least 
twice; in other words, there must be at least one set 
of at least two people in China with the same number 
of hairs on their heads. 


The answer to the second puzzle is no. It can't 
be proven that there must be at least two people in 
China with the same arrangement of teeth. Humans 
have as many as 32 teeth. You can represent any 
teeth arrangement (missing/existing) with a 32-bit 
bitmap. The number of possible combinations is 
23? —more than 4 billion. Because there are more 
possible combinations of teeth arrangements than 
the number of people in China, it's possible that all 
Chinese have unique teeth arrangements. 


OCTOBER'S PUZZLE: 

2 MATHEMATICIANS 

Two mathematicians (let's call them M and N)—once 
good friends—meet after a long time to have a drink 
together. M asks, "Are you married? Any kids? Do you 
still live in that old apartment building?" N replies, 
“Yes, l'm married with three kids, and we live in a 
private house now." M asks, "How old are your kids?" 
N replies, "Let me answer with a riddle: The product 
of the ages of my kids is 36. Now, see that bus over 
there? The sum of my kids' ages is equal to that bus 
number." M thinks for a moment, then says, "I don't 
have sufficient information to solve the puzzle." N 
replies, "Oh, yes, you're right, | forgot to mention 
that one of my kids was born before we bought the 
house." Soon after N provides this last bit of informa- 
tion, M solves the puzzle and tells N the correct ages 
of the kids. Can you figure out the solution? Also, 
how would the solution change if N’s additional piece 
of information was that one of his kids was born after 
he bought the house? 


InstantDoc ID 96804 
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Making the MOST 
ot AUTOMATIC 


STATISTICS 


UPDATING 


Tune performance by using UPDATE STATISTICS 
and trace flags to improve index statistics 


you've ever researched performance tuning, 
if you're well aware that correct indexing is 

probably the most important area on which 
to spend your tuning efforts. Good performance 
requires good indexes—but for SQL Server's opti- 
mizer to recognize the usefulness of your indexes, 
you need high-quality statistics. Starting in SQL 
Server 7.0, SQL Server updated index statistics 
automatically, which improved the chances of your 
statistics being up to date. Prior to version 7.0, 
statistics could only be updated manually; thus, the 
first task on most performance-troubleshooting 
checklists was to update the statistics on all tables 
needed by the troublesome queries. 

Automatic updating of your index statistics Is a 
great feature, but it isn’t perfect. We’ll look at how SQL 
Server's optimizer determines when statistics should 
be updated, then learn about some new trace flags in 
SQL Server 2005 SP1 that give you more control over 
the automatic-statistics-updating functionality. 


Keeping Statistics Updating On 

By default, all SQL Server databases are created 
with the AUTO_UPDATE_STATISTICS database 
option enabled. For most tables, the work required 
to update statistics won't affect your system's perfor- 
mance enough for you to consider turning off this 
option. Of course, there are exceptions, but gener- 
ally you should assume that this option should stay 
on unless you're certain that your applications run 
better with it off. In most cases, the performance 
degradation you'll experience if statistics. aren't 
regularly updated will be much worse than perfor- 
mance problems you might encounter by leaving 
this option on. 

If you believe automatic statistics updating 
impedes your system performance, you have a 
couple of alternatives for capturing statistics. One 
is to turn off automatic updates for a single table 
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or index, then use ALTER INDEX and set the 
STATISTICS NORECOMPUTE option to ON. 
Another choice is to use a SOL Server 2005 feature 
called AUTO UPDATE STATISTICS ASYNC, 
which can be set with ALTER DATABASE. If this 
option is on, SQL Server will automatically update 
statistics in the background, and the query that 
detected that statistics needed updating won't wait 
for the statistics updating to occur. Individual que- 
ries won't be affected by the updating of statistics, 
and the system as a whole will benefit by having 
the statistics regularly updated. You can read more 
about ALTER INDEX and ALTER DATABASE 
in SQL Server Books Online (BOL). 

Although having automatic statistics updating 
1s rarely detrimental, one problem with this feature 
Is that 1t's activated only after a substantial number 
of changes have occurred in the index key values. 
Although very small tables might have their index 
statistics updated more aggressively, any table 
with more than 500 rows needs to have at 
least 20 percent of its rows modified, or the 
number of rows needs to increase by at least 
20 percent in order for SQL Server to detect 
that statistics are out of date. This means that for a 
100,000-row table, you have to update or insert at 
least 20,000 rows. 

If the updates or inserts are evenly distributed 
throughout the table, statistics updating might not 
be necessary. For example, if you modify 10 percent 
of the rows in the table, and the updates are evenly 
spread throughout the table, the original statistics 
could still continue to provide satisfactory estimates 
of the number of rows that would satisfy a query. 


Updating Statistics After a Range 
of Values Changes 

However, if the data changes so that there are many 
new values greater than the previous maximum data 
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value, it might be impossible to obtain good plans until 
the statistics are updated. Let’s look at an example. 

Run the code in Listing 1 to build a table called 
dbo.details in the AdventureWorks database, and 
copy the rows from Sales.SalesOrderDetail into the 
table. Listing 1 then builds a clustered and nonclus- 
tered index on the table. 

The dbo.details table has 121,317 rows and 1,258 
pages. You should be aware that the nonclustered 
index on SalesOrderID will be used when only a few 
rows from the dbo.details table are accessed. For this 
table, the cutoff is about 350 rows. A clustered index 
scan, which is the same as a table scan, is used for 
more than about 350 rows. Look at the execution 
plans for the following two queries to verify this. 


SELECT * FROM details 
WHERE SalesOrderID » 75000 


SELECT * FROM details 


WHERE SalesOrderID « 56000 


The first query returns 301 rows and uses a non- 
clustered index seek and a lookup into the clustered 


LISTING 1: Creating a Table and Building 2 Indexes on It 


USE AdventureWorks 

GO 

IF OBJECT ID('dbo.details', 
DROP TABLE dbo.details 


'U') IS NOT NULL 


GO 
CREATE TABLE dbo.details ( 
ESalesOrderID] Гіп%1 NOT NULL, 
ESalesOrderDetailID] Cint] NOT NULL, 
CCarrierTrackingNumber] Гпуагсһаг1(25) NULL, 
COrderQty] Csmallint] NOT NULL, 
EProductID] Гіп%1 NOT NULL, 
LSpecialOfferID] Гіп%1 NOT NULL, 
CUnitPrice] Cmoney] NOT NULL, 
CUnitPriceDiscount] Cmoney] NOT NULL CONSTRAINT 
CDF_SalesOrderDetail_UnitPriceDiscount1 
DEFAULT ((0.0)), 
CLineTotal] AS Cisnull((CUnitPrice]*((1.0)- 
CUnitPriceDiscount1))*COrder@ty1,(0.0))), 
Crowguid] Cuniqueidentifier] ROWGUIDCOL NOT NULL 
CONSTRAINT 
CDF_SalesOrderDetail_rowguid] DEFAULT (newid()), 
CModifiedDate] Cdatetime] NOT NULL CONSTRAINT 
CDF_SalesOrderDetail_ModifiedDate] 
DEFAULT (getdate()), 
» 
GO 


-- Next, insert data into the new table 


INSERT INTO dbo.details 

(SalesOrderID, SalesOrderDetailID, CarrierTrackingNumber, 
OrderGty, ProductID, SpecialOfferID, UnitPrice, 
UnitPriceDiscount, rowguid, ModifiedDate) 

SELECT SalesOrderID, SalesOrderDetailID, 
CarrierTrackingNumber, OrderQty, ProductID, 
SpecialOfferID, UnitPrice, UnitPriceDiscount, 
rowguid, ModifiedDate 

FROM Sales.SalesOrderDetail 

GO 


-- Now create a clustered and nonclustered index on the new 
-- details table 


CREATE CLUSTERED INDEX 

idx cl, col1 ON dbo.details(SalesOrderDetailID); 
CREATE NONCLUSTERED INDEX 

idx nc, col2 ON dbo.details(SalesOrderID); 

GO 
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index. The second query returns 57,877 rows and 
uses a clustered index scan. 

The maximum value in the SalesOrderID column 
is 75,123. Let's update several thousand rows to be 
greater than that maximum. The following query 
updates 14,148 rows to be greater than the previous 
maximum. 


UPDATE dbo.details 

SET SalesOrderID - 76000 
WHERE SalesOrderID « 47000 
GO 


When you look at the plan and number of logical 
reads for the following SELECT statement, you'll see 
that SQL Server continues to perform a nonclustered 
index seek because the statistics don't reflect the large 
number of rows greater than the previous maximum. 


SET STATISTICS IO ON 
SELECT * FROM dbo.details 
WHERE SalesOrderID » 75000 
GO 


You should see 44,282 reads, which is far more than 
the number of pages in the table. But if you update 
the statistics, as the statement below does, the opti- 
mizer will choose a better plan that uses a clustered 
index scan. 


UPDATE STATISTICS dbo.details 
GO 

SELECT * FROM dbo.details 
WHERE SalesOrderID » 75000 
GO 


Trace Flags 

SQL Server 2005 SP1 introduced the T-SQL trace 
flag 2389, which detects when the statistics on a 
particular index are repeatedly ascending. After 
three consecutive statistics-updating operations 
have noted ascending values exceeding the previous 
upper boundary of key values, SQL Server inter- 
nally marks the index as an ascending index. You 
can use the undocumented trace flag 2388 to show 
you whether an index's leading column has been 
determined to be ascending. This trace flag changes 
the output of the DBCC SHOW STATISTICS 
command to give you a historical look at the most 
recent statistics-updating operations. In this case, the 
only column in the DBCC SHOW. STASTISTICS 
output you'll be interested in is the last one, called 
Leading column Type. As with all undocumented 
trace flags, keep in mind that the behavior of trace 
flag 2388 isn't guaranteed to continue to work as 
described in any future version or service pack, and 
no further information is available about what the 
values mean or how they're generated. 
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Run the code in Listing 1 again to recreate the 
dbo.details table and its indexes. Then run Listing 
2 to turn on both trace flags, and perform three 
UPDATE operations, three SELECT operations, 
and three statistics updates. (If you look at the 
plans for the SELECT statements, you should see 
that they're all using an index seek inappropriately 
because of bad statistics information.) Then run 
DBCC SHOW STATISTICS to examine the sta- 
tistics for the ASCENDING state. You should see 
the values that Table 1 shows in the last column of 
the DBCC SHOW STATISTICS output. 

Now that the index 1s branded as "ascending," 
when you run one more data update, as follows, the 
statistics should be updated automatically: 


UPDATE dbo.details 

SET SalesOrderID - 82000 
WHERE SalesOrderID « 56000 
GO 


If you run the following SELECT, you'll see that 
the optimizer chooses the better plan even though 
statistics aren't manually updated. The number of 
reads should be 1,258, which indicates a clustered 
index scan. 


SELECT * FROM dbo.details 
WHERE SalesOrderID » 81000 


Another trace flag, 2390, introduced in SQL 
Server 2005 SP1, enables the same behavior as trace 
flag 2389—but only in cases where the optimizer 
doesn't know whether the leading index column 15 
ascending. So if you use both 2389 and 2390, your 
statistics should be automatically updated much more 
often. You should never use trace flag 2390 alone, since 
doing so would mean that statistics would be updated 
only when the ascending nature was unknown and not 
when the column was known to be ascending. 

If you're interested in exploring how to use trace 
flags 2389 and 2390, see the Microsoft articles “FIX: 
You may notice a large increase in compile time when 
you enable trace flags 2389 and 2390 in SQL Server 
2005 Service Pack 1" (http://support.microsoft 
.com/?kbid=922063) and “FIX: SQL Server 2005 


may not perform histogram amendments when 


you use trace flags 2389 and 2390” (http://support 
microsoft.com/?kbid-929278), which discuss prob- 


lems with the flags and fixes for them. As with any 


Microsoft hotfix, you should apply the fix only if 
you can definitely establish that the bug is negatively 
affecting your applications. 


Manual Intervention Still 
Required 

SQL Server's ability to automatically update statis- 
tics 1s useful, but you can't assume that Just because 
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LISTING 2: Turning on Trace Flags 
2388 and 2389 


DBCC TRACEON(2388, 2389) 

GO 

UPDATE dbo.details 

SET SalesOrderID - 76000 

WHERE SalesOrderID « 47000 

GO 

SELECT * FROM dbo.details 

WHERE SalesOrderID » 75000 

GO 

UPDATE STATISTICS dbo.details 

GO 

UPDATE dbo.details 

SET SalesOrderID - 78000 

WHERE SalesOrderID « 50000 

GO 

SELECT * FROM dbo.details 

WHERE SalesOrderID » 77000 

GO 

UPDATE STATISTICS dbo.details 

GO 

UPDATE dbo.details 

SET SalesOrderID - 80000 

WHERE SalesOrderID « 52000 

GO 

SELECT * FROM dbo.details 

WHERE SalesOrderID » 79000 

GO 

UPDATE STATISTICS dbo.details 

GO 

DBCC SHOW STATISTICS('dbo.details',idx nc. 
col2) 

GO 


the feature 1s enabled, you'll never need to 
run the UPDATE STATISTICS command. 


і» 


TABLE 1: Statistics for 


Two new trace flags in SQL Server 2005 SP1 an Ascending Index 


can help the automatic statistics-updating 
functionality gather more information about 
your data distribution. Nonetheless, even 
with the trace flags, a DBA still needs to ІШІ 
monitor query performance, spot when que- | NULL 
ries are performing less than optimally due - ші 
to out-of-date statistics, and either manually 
update the statistics or set up a job schedule 
to update them more often than the auto update 
feature allows. SOU 
InstantDoc ID 96767 
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Ascending 


Leading column Type 


To learn more about optimizing queries, statistics, and internals: 
"Exploring the Auto Update Statistics Option," InstantDoc ID 42590 
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For information about trace flags: 
“Investigating Trace Flags," InstantDoc ID 26406 


"Trace Flag Changes in SQL Server 2005," InstantDoc ID 92774 
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Data Warehousing: 


Dimen 


on. 
СЛЕТ ес 


The "arms" of the star schema give meaning 
and context to the measures in the fact table 


fits into the dimensional model in “Data Ware- 

housing: Measuring the Facts," September 2007, 
InstantDoc ID 96336. Now let's address the fun- 
damental design of the dimension—the “arms” of 
the star schema. Dimensions are tables that contain 
textual explanations of the coded and key values 
that are present in the star schema's fact table. The 
topic of dimensional modeling encompasses a large 
body of work, so if I miss something that you think 
should have been included, it's probably for lack of 
space and not for lack of interest or importance. You 
can contact me directly or join the forum discussion 


(go to http:/Avww.sqimag.com/go/dbdesign). That 
way the topic will get on my editorial calendar. 


| covered the basics of the fact table and how it 


Defining Dimensions 
Dimensions are a common way of analyzing data. 
In the article above and in “Discover the Star 
Schema,” July 2007, InstantDoc ID 96112, I implied 
that the dimensional model is user-driven; now I'm 
going to say it loud and clear: The dimensional 
model 1 laid out for ease of use and to be logical and 
understandable to business users. Dimensions 


ORE on the WEB define the dimensional model and spell out 


Read the Web-exclusive sidebar 
at InstantDoc ID 96848. 


the who, what, where, when, why, and how of 
the situation. They add context and meaning 
to the measures contained in the associated 
fact table. 

In a dimension table, there is one row per 
product, one row per customer, one row per facility, 
one row per time unit, or one row per geographic 
region, and the granularity or level of detail of 
these rows depends on how finely detailed the busi- 
ness user wants the data. For instance, is it enough 
to know the average dollar value of each sale? Or 
do you need to know the average dollar value of a 
sale for each day of the year? Do you need to know 
how much of the sale value was for perishable items 


versus nonperishable items? Do you need to know 
purchases down to the line item level? 

It’s very important to understand what the busi- 
ness reporting needs are before you start designing a 
data warehouse. You simply cannot be a cowboy and 
start slinging data around with data warehousing 
and dimensional design—there’s too much at stake: 
Too much data is involved in a data warehouse, the 
project visibility is too high, and it’s too costly to 
undo careless mistakes. You must plan. You must 
know how granular the level of detail needs to be. 


Structuring Dimensions 

At this point, if you're thinking “cube,” think about 
the flat surfaces of the cube—those are the dimen- 
sions. Dimensions control query filtering—for 
example, “...where region like West” in T-SQL or 
“WHERE (Region.West)” in MDX—and supply 
almost all labeling in the output result sets that are 
turned into reports. Dimensions provide the “by” 
words for a report— "sales by month by product,” 
“inventory by category by region." When you think 
about it, dimensions are the natural way a business 
user would talk about business. 

Each dimension contains data from a single 
domain, such as the time, product, or geographic 
area domain. Dimensions act like decoders when 
you “flatten” the lookup table hierarchy, as Figure 
1 shows. If the three tables on the left were part 
of an entity relationship diagram (ERD) for an 
online transaction processing (OLTP) database, 
Product. Category and Product, Subcategory would 
be lookup tables in a hierarchy, with Product, Cat- 
egory higher in the hierarchy than Product, Subcat- 
egory. The purpose of a lookup table is twofold: to 
enforce domain integrity by limiting the list of codes 
that can be used in the modified table, and to help 
reduce data redundancy by adhering to a production 
level of third normal form. 
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DIMENSION BASICS 


In a dimensional model, you'd 
consider flattening the hierarchy of 


PRODUCT. CATEGORY 


ProdCatKey int 


«pk 


products, categories, and subcatego- 
ries, as you can see in the Product_ 
With Category table. Each row 


ProdCatCode 


nvarchar(20) «ak 


ProdCatName пуагсһаг(50) 


PRODUCT. SUBCATEGORY 


in the Product With, Category 


ProdSubCatKey int 


<pk> 


dimension contains all the category 
content that’s relevant for that 
row, in addition to all the product 
information. You don’t need the 
one-to-many relationships of an 
ERD because you shouldn’t have 
to worry about domain integrity. 
The data has already gone through 
the extraction, transformation, and 
loading (ETL) process and has 
(presumably) been scrubbed and 
validated before loading into the 
data warehouse. But what about 


<] ProdCatKey int 
ProdSubCatCode nvarchar(20) <ak> 
ProdSubCatName nvarchar(50) 


«fo 


PRODUCT WITH. CATEGORY 


ProductKey int 


«pk 


PRODUCT 


A ProductKey int <pk> 
< ProductCode nvarchar(20) <ak> 
ProdSubCatKey int <fk> 
ProductName nvarchar(50) 
ListPrice money 
StandardCost топеу 
nvarchar(50) 


identity 
not null 
null 
null 
null 
null 
null 
null 
null 
null 
null 


nvarchar(50) 

nvarchar(50) 

float 
WeightUnitMeasure nvarchar(50) 


ProductCode 


ProductName 
ListPrice 
StandardCost 


nvarchar(20) <ak> 


ProdCatCode int 
ProdCatName int 
ProdSubCatCode int 
ProdSubCatName int 


nvarchar(50) 
money 
money 
nvarchar(50) 
nvarchar(50) 
nvarchar(50) 
float 


WeightUnitMeasure nvarchar(50) 


redundancy? With a data ware- 

house, the goal is not to reduce redundancy—data 
warehousing involves a different mindset from the 
transactional, operational database. If you're going 
to be building multidimensional cubes, you're better 
off flattening the hierarchies than not. 

Flattening hierarchies also makes the join paths 
between the dimensions and their associated fact 
tables much simpler, resulting in better performance 
and faster query results (we hope). The one-to-many 
relationships between the dimension and its associated 
fact table, illustrated in Figure 2, page 34, aren’t meant 
to protect data integrity. These relationships are meant 
to associate keys in the fact table with the expanded 
definitions, which are found in the dimensions. 

In any database there are hierarchies. Geographic 
hierarchies occur where zip codes aggregate into 
counties, counties into states, states into countries. 
With temporal hierarchies, hours aggregate into 
days, days into weeks and months, months into 
quarters, and quarters into years. In spatial hierar- 
chies, rooms aggregate into buildings, and buildings 
into campuses or city blocks, and so on. 

Hierarchies such as these and the product 
category-subcategory hierarchy in Figures 1 and 
2 are used in reporting and summarizing results. 
Depending on what information users require, they 
might use the hierarchies differently from other 
users. One of the most common scenarios is the 
dichotomy between marketing and manufacturing: 
What manufacturing considers one product might 
be many products to marketing, as when hospital 
scrubs were co-opted by young adults as the latest 
fashion statement. When generating reports, manu- 
facturing might not want to use product subcatego- 
ries, whereas marketing would. 

The flattening example in Figure | is the type 
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of denormalization you'll often see in a dimen- 
sional model. We denormalize to avoid an excessive 
number of join operations. In future columns ГЇЇ 
examine dimensional denormalization more closely, 
but for now, just remember to denormalize in such 
a way that you lose nothing except the OLTP struc- 
tural complexity. 


Dimensional Granularity 

In my September article cited above, I explained 
that fact tables have “grain,” or levels of detail. It 
turns out that dimensions also have grain, and the 


BY NORMALIZING? 


Figure 1 


Flattening a hierarchy of 
dimensions decodes original 
values 


ARE YOU REALLY SAVING ANYTHING 


How much are you saving on disk space by normalizing a dimension hierarchy 
instead of flattening it? Consider a table containing 1 million 300-byte rows. 
Rounding up to accommodate record headers and null bitmaps, this table would 


take up about 300MB of space. 


From the original dimension, remove a 15-byte category and replace it with a 
2-byte category key: You've just saved yourself about 6.35MB of space (1 million 


times 13 bytes). 


But then you have to add space needed for the category table—let's say 1,000 
15-byte rows for a total of about 15KB. Subtract this from your savings for the 
dimension, and your space savings is now down to 6.3MB, out of the 300MB you 


started with. That's not bad, but that's not the full story. 


Every dimension is associated with at least one fact table, and the fact table in 
this scenario is probably sized in the tens of gigabytes. We have to factor in the fact 
table because the dimension and the category will be joined to it an uncountable 
number of times—it's all part of the same picture, really. Let's say, for the sake of 
argument, that the fact table is 25GB. Now you're looking at 6.3MB saved on a 
total of 25.3GB—at this point, it's not even worth doing the math. You just don't 
save significant space by normalizing hierarchies in a data warehouse. Nor should 
you normalize in order to protect data integrity—that's what the extraction, transfor- 
mation, and loading (ETL) process is for. You should be denormalizing and laying 
out records so that users can navigate and query with ease and confidence. 


InstantDoc ID 96847 
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PRODUCT. WITH. CATEGORY 


ProductKey 


int 


«pk 


ProductCode 
ProdCatCode 
ProdCatName 
ProdSubCatCode 


ProductName 


ListPrice 
StandardCost 


SALE 


ProdSubCatName 


nvarchar(20) «ak 


int 

int 

int 

int 
nvarchar(50) 
money 
money 
nvarchar(50) 
nvarchar(50) 
nvarchar(50) 
float 


S. TERRITORY 


RESELLER. SALES 


ResellerSalesKey numeric 
ProductKey int 
SalesTerritoryKey int 
SalesOrderNbr nvarchar(20) 
SalesOrderLineNbr smallint 
RevisionNbr tinyint 
OrderQuantity int 
UnitPrice money 
ExtendedAmount money 
UnitPriceDiscountPct float 
DiscountAmount money 
ProductStandardCost money 
TotalProductCost money 


<pk> 
<fkl> 
<fk2> 


SalesTerritoryKey int 


SalesAmount money 


<pk> 


nvarchar! 
nvarchar! 
nvarchar! 
nvarchar! 
nvarchar! 
nvarchar! 


50) 
50) 
50) 
50) 
50) 
50) 


TaxAmount money 

c FreightAmount money 
CarrierTrackingNumber nvarchar(50) 
CustomerPONumber ^ nvarchar(50) 


Figure 2 


The flattened dimensional 
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hierarchy 


granularity of a dimension depends on the reporting 
requirements—just like the fact table's granularity. 
Unlike any other database modeling scheme, dimen- 
sional modeling is truly customer-driven. 

How fine-grained do you want your dimensions 
to be? Obviously, the level of detail of the dimension 
table has to be consistent with the level of detail of 
the fact table that it modifies. If you have a fact table 
that tells you that customer А bought product B at 
store C in region F on date D and paid for the pur- 
chase using E, and that the sale was credited to sales 
associate G, then all those dimensions, A through 
G, had better be able to define and describe the A 
through G identifiers in the fact table. 

One school of thought holds that a data ware- 
house should contain only summary data and that all 
available detail data should be stored in the associated 
OLTP source databases. That approach might work 
in some cases. However, I believe that fact tables and 
dimensions are an archive of historical activity and 
should contain the finest level of detail that you'll ever 
need for a report. One of the largest data warehouses 
in the world belongs to Wal-Mart—1t’s measured in 
petabytes (thousands of terabytes) and holds one 
and a half years of data at the individual sale level. 
If you need to be able to analyze individual sales 
in your organization, you'll need the finest level of 
granularity. It's better to create a data warehouse that 
stores data with a fine level of detail and not need that 
detail, than to start with summarized data and then 
realize you need more detail. 


Understandability and 
Performance 

The twin properties of a data warehouse are under- 
standability and performance. By flattening hierar- 


chies and not losing any data in the process, you make 
the dimension more understandable to the people 
who use the data. What can you do to enhance perfor- 
mance? You guessed it—flatten the hierarchies! 

One of the most common user activities in a 
dimensional model is drilling down to get more 
details. Drilling down means adding more row 
headers to the result set. Drilling up simply takes 
away those row headers You could drill down 
through the product category into a specific product 
subcategory until you found all the products in that 
subcategory, and never leave the Product, With, Cat- 
egory dimension. 

The Sales Territory dimension in Figure 2 is 
an excellent example of a dimension suited for 
drilldown. You can start with the Americas (zone), 
move on to the United States, Western region, Colo- 
rado, Denver County, 80208, and do all that with a 
single table, meaning no joins (at least not until you 
want some sales facts that relate to zip code 80208). 
Sales_Territory is a richly defined geographic dimen- 
sion with a flattened hierarchy. 

Flattenmg hierarchies is the same as elimi- 
nating snowflaking in the star schema. Hierarchies 
expressed as separate tables (such as the three tables 
on the left side of Figure 1) create snowflakes, 
which complicate the user presentation and often 
intimidate users, causing them to shy away from 
your data warehouse. Snowflaking (normalizing the 
hierarchy) causes most data warehouse applications 
to run relatively slowly because of all the joins in the 
background, lowering the quality of the user expe- 
rience. The savings on disk space that you achieve 
with snowflaking isn't worth it. For more discussion 
about snowflaking and disk space, see the sidebar 
"Are You Really Saving Anything By Normal- 
izing?” page 33.) And finally, snowflaking defeats 
the purpose of bitmap indexing, which SQL Server 
Analysis Services uses internally to optimize query 
response. To learn more about bitmap indexing and 
when it's used, see the Web sidebar “What’s a Bitmap 
Index?” (http://sqlmag.com/Article/ArticleID/96848/ 
sql_server_96848.html). 


Dimensions Illuminate Facts 
When you analyze data in a data warehouse, you 
start from a dimension. Dimensions add meaning 
and context to the fact table in a dimensional 
model and make the data accessible in an intuitive 
way. Because there’s so much more to say about 
dimensions, ГІІ continue this discussion in a future 
article. Until then, you can go to the Forum link at 
the beginning of this article to communicate with 
me and other readers about the fascinating topic of 
dimensions and data warehousing. [SQL 
InstantDoc ID 96846- 
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Protect UDM with 
Dimension Data 
Security, PART 2 


Harness the power of MDX expressions 


I explained in “Protect ОЮМ with 
S Dynamic Dimension Security" (July 
2007, InstantDoc ID 95998), SQL 


Server 2005 Analysis Services (SSAS) offers a robust 
role-based security model for restricting access to 
Unified Dimensional Model (UDM) objects and 
data. UDM dimension data security protects dimen- 
sion members and the data associated with them. 
My previous article explained the fundamentals 
of dimension data security. In this article, I explain 
how to harness the power of MDX expressions to 
implement dynamic dimension security. I discuss 
two practical approaches for meeting advanced 


requirements for 
ORE on «һе WEB 


securing UDM 


Download the Web listings and data. First, I 
project files at InstantDoc ID present a solu- 
388. tion in which the 


security policies are stored in a factless fact table. 
Next, I explain how to leverage SSAS stored proce- 
dures to integrate UDM with an external security 
service. 

My sample Dimension Security project demon- 
strates how you can set up advanced dimension data 


security. To obtain this project, go to http://www 


.sqlmag.com, enter InstantDoc ID 96763, and click 
the Download the Code zip file. Before deploying the 
SSAS project, you need to create a new table called 
FactSecurityFilter in the AdventureWorksDW 
relational database and populate it with data by 
executing the FactlessFactTable.sql script found in 
the zip file. 


Understanding Dynamic 
Dimension Security 

As I explained in my previous article, you imple- 
ment dimension data security by defining allowed 
and denied sets. An allowed set denies access to all 
attribute members except a set of allowed members. 
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Conversely, a denied set lets you see all attribute 
members except a set of denied members. A user can 
access only the cube space defined by the attribute 
members the user 1s allowed to see. From an end- 
user perspective, unauthorized members and their 
data simply don't exist in the cube. 

Basic dimension data security limitations. Basic 
dimension data security lets you define allowed and 
denied sets by selecting attribute members at design 
time. However, most real-life security requirements 
would outgrow basic dimension data security. 

First, basic dimension security might require mul- 
tiple database roles. For example, a common security 
requirement is to let a manager see only the sales 
data of the employees reporting directly or indirectly 
to him or her. If basic dimension data security were 
the only option, you'd have no choice except to set 
up a database role for each manager. Granted, SSAS 
2005 significantly improves dimension data security, 
to ensure that UDM scales well with many roles and 
large dimensions. (For more information about these 
improvements, see the Learning Path: Improvements 
to dimension security in Analysis Services 2005.) How- 
ever, as the number of database roles increases, so will 
your maintenance effort. 

In addition, the members of allowed and denied 
sets might be unknown at design time. For example, 
suppose you need to obtain the authorization 
policies from an external security service. UDM sup- 
ports dynamic dimension security to address such 
advanced security requirements. 

Implementing dynamic dimension data expres- 
sions. The term dynamic emphasizes the fact that 
the server resolves the allowed and denied sets at 
runtime. You can use the Dimension Data Design- 
ers Advanced tab to implement expression-based 
allowed and denied sets. Use the MDX Username 
function to make the expression user-specific. This 
function returns the Windows identity of the user in 
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in the format DomainName\UserName. For example, 
if Stephen logs in to the adventure-works domain 
as ѕіерһеп0, Username would return adventure- 
works\stephen0. Because you can use expressions 
with dynamic dimension security, you might need 
only a single database role. 

The Employee role in 
my sample Dimension 
Security project demon- 
strates dynamic dimen- 
sion security. Its allowed 
set expression restricts 
each manager to see the 
sales data of his or her 
direct and indirect sub- 
ordinates based on the 
Employees parent-child 
dimension. 

The  StrToMember 


security in Analysis 


MDX function returns 

the Login ID member 
associated with the interactive user. Thus, if Stephen 
Jiang queries the cube, the StrToMember function 
returns the [Employee].[Login ID].&[adventure- 
works\stephen0] member. Next, the Exists function 
finds the corresponding member in the Employee 
Name hierarchy. Assuming again that Stephen is the 
interactive user, Exists returns [Employee].[Employee 
Name].&[272]. 

Dimension data security for the parent-child 
dimension must be defined on the parent key (1.е., 
the attribute that defines the parent-child hierarchy) 
rather than on the dimension key. I use the Link- 


[Employee Level 02 + tssployec Level OF Employee Level 04 


IE) Ken J. Sánchez. Grin S. Welcher gj Stephen 


Figure 1 


Report showing data for 
Stephen Jiang and his 
subordinates 
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ToMember func- 

tion to find the 

corresponding 
member in the 

[Employee] 

.[Employees] 

0,6 e parent key attri- 
bute. Since the Exists function returns a set, I use 
the Item(0) function to return the first member of 
the set (there should be only one since an employee 
has a single login ID). Also, note that the expres- 
sion doesn’t specifically request the employee's 
subordinates (e.g., by using the MDX Descendants 

function), because granting access to a member in a 

parent-child hierarchy automatically grants access 

to its descendants. 
Testing the Employee role. Follow these steps to 
test the Employee role: 

1. Open the Dimension Security project in Business 
Intelligence Development Studio (BIDS) or in 
Visual Studio 2005. 

2.In Solution Explorer, expand the Roles folder 
and double-click the Employee role to open it 


in the Roles Designer. Switch to the Dimension 
Data tab. 

.Expand the Dimension drop-down list and 
select the Employee (attribute security defined) 
dimension. 

4. Expand the Attribute drop-down list and select 
Employees (attribute security defined). The 
Allowed Member Set text box shows the Link- 
Member MDX expression. 

. To quickly test the Employee role for a given 
employee, replace Username in the allowed 
set expression that Web Listing 1 (http://www 
.sglmag.com, InstantDoc ID 96763) shows with 
the login ID of the desired employee (eg., 
adventure-works\stephen0). 

6. Deploy the Dimension Security SSAS project; 
right-click the Dimension Security project node 
in the Solution Explorer window and select 
Deploy. 


чә 


Cn 


If you want to test the MDX query in SQL Server 
Management Studio (SSMS) to see the allowed set 
for a given employee, you can follow these steps: 

1. Open SSMS and connect to the SSAS instance. 

2. Right-click the Dimension Security database and 
select New Query, MDX. 

3. Web Listing 2 provides an example MDX query 
that returns the subordinates of Stephen Jiang, 
whose Windows login is adventure-works\ste- 
phenO. Enter this query in the MDX query pane 
and execute it by clicking the Exclamation toolbar 
button or pressing Ctrl+E. 


Follow these steps 1f you want to test the User- 
name function in the role: 

1. Use SSMS to open the Employee table in the 
AdventureWorksDW relational database. 

2. Change thelogin ID of the desired employee, such 
as Stephen Jiang, to your Windows login ID. 

3. Connect to the SSAS server and expand the 
Dimension Security database and. Dimensions 
folder. Right-click the Employee dimension and 
select Process. 

4. Make sure that the Processing Options column 
Is set to Process Update. Click OK to process the 
Employee dimension. 

. Right-click. Ше Adventure Works cube in 
the Dimension Security database and select 
Browse. 

6. Click the Change User toolbar button and select 
the Employee role. Finally, create a report similar 
to the one that Figure 1 shows by dragging the 
Employees hierarchy from the Employee dimen- 
sion on columns and the Reseller Sales-Sales 
Amount measure from the Measures, Internet 
Sales display folder on data. 


Cn 
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As expected, the report shows data only for 
Stephen Jiang and the AdventureWorks employees 
who report to him. For example, Amy Alberts isn't 
shown on the report because she's at the same level 
as Stephen in the Employees hierarchy. You might 
be surprised to see that the report shows Stephen's 
managers (Brian Welcker and Ken Sanchez). This is 
because enabling a member in a parent-child hier- 
archy automatically grants access to the member's 
ancestors. Consider enabling Visual Total on the 
Advanced tab (Dimension Data tab in the Cube 
Browser) if you want the ancestors' totals to be 
contributed only by the allowed descendants. 

As you can see, MDX expressions and the User- 
name function give you a lot of flexibility to secure 
UDM data. Now let's discuss two more advanced 
scenarios for dynamic dimension security. 


Importing Security Policies into a 
Factless Fact Table 
Suppose that the AdventureWorks online transac- 
tion processing (OLTP) application has complex 
authorization rules that dictate which resellers 
an employee is authorized to see. Your task is to 
propagate the same rules to UDM. One option is to 
replicate the security infrastructure from the source 
application. However, doing so will result in duplica- 
tion of the management effort necessary to maintain 
the security policies in two places. Instead, consider 
importing only the authorized members into a fact 
table that acts as a security filter. This would require 
importing a dataset that contains the authorized 
resellers per employee. The necessary steps to use 
this approach include implementing the fact table, 
implementing the measure group, implementing the 
database role, and testing the database role. 
Implementing the fact table. ^ new fact table 
(FactSecurityFilter) 1s required to store the autho- 
rized resellers for each employee, as Figure 2 shows. 
Its schema is simple. The table has two columns that 
store the identifiers (primary keys) of employees and 
authorized resellers. For example, if Stephen Jiang 
has a primary key of 272 in the source database and 
he is authorized to see resellers Associated Bikes 
(primary key 7) and Finer Mart (primary key 38), 
the fact table will have the following rows. 


EmployeeKey ^ ResellerKey 
272 7 
272 38 


The FactSecurityFilter fact table doesn't have 
any numeric facts (hence the term “factless” fact 
table). Note also that FactSecurityFilter doesn't keep 
an historical record of the security policies (1.e., it 
doesn't join the DimDate table). The assumption is 
that the last security policy prevails. For example, if 
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Stephen is authorized to see a reseller in the current 
period, he will be granted access to that reseller even 
though he might not have been authorized in the 
previous period. 

The SQL script Fact- 
lessFactTable.sql lets 
you create and populate 
the FactSecurityFilter 
fact table. An extrac- 
tion, transformation, 
and loading (ETL) task 
could refresh the FactSe- 
curityFilter fact table on 
a regular basis (e.g., as 
part of the data import 
process). 


Implementing the 

measure group. Next, you need to map the Fact- 

SecurityFilter table to a UDM measure group as 

you would any other fact table, by following these 

general steps: 

1. In BIDS, add the fact table to the data source 
view (DSV). 

2. Create a new Security Filter measure group that 
uses FactSecurityFilter as a source, as Figure 3 
shows. 

3. A measure group must have at least one measure; 
therefore, leave the system-generated Security 
Filter Count measure. Don't set the Visible prop- 
erty of the measure to False in an attempt to hide 
the Security Filter measure group from the end 
user. If you do so, the MDX expression for the 
allowed set won't work (reported as a bug). 

. Select the Dimension Usage tab in the Cube 
Designer to verify the dimension relationships to 
the Security Filter measure group. 


A 


Figure 2 


The FactSecurityFilter fact 
table stores the authorized 
members 


At this point, the Cube 
Designer should have created 
two dimension relationships 
(Reseller and Employee) to the 


|9. Cube Structure |91 Dimension Usage | 


$a uui ux 


Security Filter measure group. | Measures 


(More detailed instructions for 
working with DSVs and mea- 
sure groups in BIDS can be 
found in the SOL Server Books 
Online—BOL—SQL Server 


(@ Adventure Works 
Œ [ui] Internet Sales 
@ m Reseller Sales 
E [ul] Security Filter 
ss] Security Filter Count 


2005 Analysis Services Tuto- 
rial, at http://msdn2.microsoft. 
com/en-us/library/ms170208.aspx.) 

Implementing the database role. With the Security 
Filter measure group in place, you're ready to set up 
a database role. 

1. Right-click the Roles folder in the Solution 

Explorer window in BIDS and select New 

Role. 


Figure 3 


Creating a Security Filter 
measure group that maps 
to the FactSecurityFilter 
fact table 
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Figure 4 


Using the Exists function to 
construct the allowed set 
for the authorized resellers 
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2. In the Solution Explorer window, 
rename the new role in place to 
Factless. 

3. In the Role Designer, switch to the 
Cubes tab and grant the Factless 
role read access to the Adventure 
Works cube. 

4. In the Dimension Data tab, select the 
Reseller cube dimension and the Reseller Name 
attribute. 

5. Enter the MDX expression for the allowed 
member set, as Figure 4 shows. This expression 
uses the Exists function to select the associated 
resellers for the interactive users over the Security 
Filter measure group. Note that you don’t have to 
use the attribute hierarchies that map to the keys 
in the fact table. For example, I use the Login ID 
attribute hierarchy in the expression instead of 
Employee Name. 


Figure 5 


Testing the database role. To quickly test the new 
role, replace the Username function with a member. 
For instance, use the following expression to test the 
database role for Stephen Jiang: 


Exists(LResellerl.LReseller Namel. 
MEMBERS, 

StrToMember("LEmployeel.LLogin 
ID1.&Ladventure-worksMstephen()1'), 
"Security Filter") 


Follow these steps to test the Factless role: 

1. Deploy the project. 

2. Right-click the Adventure Works cube and select 
Browse. 

3. Click the Change User toolbar button in the 
Cube Browser and select the Factless role. 

4. Create a report that has Reseller Name on rows and 

Reseller Sales-Sales Amount measure on data. 


Figure 5 shows the report results for Stephen 
Jiang. The report shows only 45 resellers (out of 702 
total), because Stephen Jiang is associated with these 
resellers only in FactSecurityFilter. In SSMS, you can 
use the following MDX query to verify the expression 
against the Dimension Security database: 


select LMeasuresl.LReseller Sales- 
Sales Amount] on 0, 


921: 1: ЖШ "07227 
© rou are browsing the cube using the credentials of the following roles: Factiess 


Perspective; Adventure Works м 


$46,616.67 
$37,289.18 
$61,696.56 


Resellers report results 


Exists(LResellerl.LReseller 
Namel.LReseller Namel1.MEMBERS, 
LEmployeel.LEmployee Name1.&L2721, 

"Security Filter") on 1 
from [Adventure Works] 


At this point, you might be concerned with the 
performance implications of the factless fact table 
approach. After all, thousands of authorized mem- 
bers could exist for each user, which might cause the 
fact table to increase considerably in size. In case 
you're wondering whether the SSAS storage engine 
can evaluate large allowed sets efficiently, I did a 
performance study for a real-life UDM with several 
million rows in the fact table; my study showed that 
the server was capable of constructing the allowed 
set in several seconds! 

Excellent performance is the most important 
advantage of the factless fact table approach. 
Because the server applies security when each user 
connects, minimizing the time to initialize dimen- 
sion data 15 essential. You might find other ways to 
further boost performance based on your require- 
ments. For example, if an employee has access to 
all resellers, don't import all members into the fact 
table. Instead, consider adding a new attribute to the 
Employee dimension that flags power users. Then, 
shortcut your allowed set expression to bypass the 
Exists function and return all members. 


Using SSAS Stored Procedures to 
Externalize Security 

What if the factless fact table approach isn't an 
option? For example, if you need to secure several 
dimensions, importing security policies into a fact 
table might be counterproductive. Or, you might have 
no other choice but to obtain the allowed set from an 
external security service at runtime. SSAS stored pro- 
cedures can help in these situations. Although SSAS 
stored procedures might not be as efficient as the fact- 
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less fact table approach, they give you more flexibility. 
To illustrate this benefit, let's create an SSAS stored 
procedure to return the authorized resellers. 

Implementing SSAS stored procedures. An SSAS 
stored procedure is nothing more than a method in 
a .NET assembly that's deployed to the SSAS server. 
Web Listing 3 shows the GetSecurityFilter stored 
procedure from the Extensibility VB.NET class 
library project. 

GetSecurityFilter returns a Microsoft.Analysis- 
Services.AdomdServer.Set object for the allowed 
set. For the sake of simplicity, GetSecurityFilter 
queries the AdventureWorksDW relational database 
to obtain the authorized resellers for the interactive 
user. Next, GetSecurityFilter enumerates through 
the dataset and constructs the allowed set. 

StrToSet vs. SetBuilder. Two options exist for 
creating an MDX set in an SSAS stored procedure. 
The first option should be familiar to readers expe- 
rienced with SSAS 2000. It uses the StrToSet func- 
tion to convert a comma-delimited string of tuples 
to an MDX set. The second option is new with 
SSAS 2005. It uses the Microsoft.AnalysisServices 
.AdomdServer.SetBuilder object to construct the 
set. SetBuilder far outperforms StrToSet. Table 1 
shows performance statistics gathered from a real- 
life project that compared both options. 

As the number of rows increases, SetBuilder’s 
performance advantage becomes clear. Note that 
40,000 records took 272 seconds to convert to a set 
with StrToSet and 144 seconds with SetBuilder (for 
a 47 percent performance increase). However, even 
with SetBuilder the SSAS stored procedure approach 
is much slower than using a factless fact table. 

Note that the server invokes the security stored 
procedure repeatedly when initializing a user ses- 
sion, once for each attribute hierarchy in the secured 
dimension. To avoid repeated calls to the external 
security service, cache the allowed and denied sets 
per user (eg., by using the .NET HttpRuntime 
.Cache object for in-memory caching. 

Setting up and testing the database role. To set up 
and test a database role that uses the GetSecurity- 
Filter stored procedure, follow these steps: 

1. Add a project reference to the Extensibility 
assembly in your SSAS project. To do so, right- 
click the Assemblies folder and select New 
Assembly Reference. 
In the Add Reference dialog, click the Projects 
tab and select the Extensibility project. Click the 
Add button to add the Extensibility project to 
the Selected Projects and Components pane, and 
click OK. 
.In Solution Explorer, select the Extensibility 
assembly and change its Permission Set property 
to External Access. 


n 
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4. Create a new database role called External. Grant 
the role read access to the Adventure Works cube. 
Use the following allowed MDX expression for 
the allowed set of the Reseller Name attribute: 


Extensibility.GetSecurityFilter 
(Username) 


5. Deploy the project and test the External role as 
you tested the Factless role. 


What if the stored procedure discovers that the 
user has rights to see all members? From a perfor- 
mance standpoint, letting the server bypass the set 
expression makes sense. Unfortunately, no method 
exists for shortcutting the MDX expression (e.g., 
by returning a null set from the stored procedure). 
Instead of returning a set with all members, con- 
sider assigning power users to a privileged database 
role. Or, if possible, add an attribute that flags these 
users and use the IIF function in the set expression 
to check the flag and bypass the stored procedure 
call. 

You can use the following MDX query in SSMS 
to test the GetSecurityFilter stored procedure: 


SELECT CMeasures].CReseller Sales- 
Sales Amount] ON Q, 
Extensibility.GetSecurityFilter 
("adventure-works\stephen@") ON 1 
FROM LAdventure Works] 


As with the factless fact table approach, the query 
should return only the resellers that Stephen Jiang 
is authorized to see. 


Secure UDM Data 

Dynamic dimension security lets you construct 
allowed and denied sets programmatically and 
apply user-specific security policies at runtime. The 
cornerstone of dynamic data security is the User- 


TABLE 1: Performance of StrToSet vs. SetBuilder 


Number of Records StrToSet (sec) SetBuilder (sec) 
5,000 52 35 
10,000 90 58 
20,000 171 108 
40,000 272 144 


name function, which returns the interactive user's 
Windows identity. If you need to obtain the security 
policies from an external system, use the factless 
fact table approach for maximum performance. 
Using an SSAS stored procedure lets you meet most 
demanding integration requirements. ЕП 
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Database Design Tools 


The proper tool simplifies 


and speeds this complex task 


Dow a database that’s flexible, fast, and 
efficient isn’t easy. In broad terms, it involves 
learning about and analyzing an organization's 
information needs, conceptualizing what the data- 
base will look like and what data it should store 
to meet those needs, transforming that conceptual 
representation to a logical design that includes rules 
and information about the structure and type of 
data, and adapting the logical design to a database 
management system (DBMS). 

SQL Server comes with SQL Diagrammer, a basic 
tool that some people use to design databases. But 
SQL Diagrammer is a bare-bones tool with some 
significant limitations. (For information about some 
of those limitations, see the Web-exclusive article 
“Describe and Design,” February 2006, InstantDoc 
ID 49185.) A third-party database design tool is a 
better choice. This buyer’s guide brings together nine 
such tools so you can compare them side by side. 


A Picture Worth 

a Thousand Words 

To build the efficient, high-performance database 

your organization requires, you need to use data mod- 

eling—the art of creating visual representations of 

a database’s data, relationships, and other elements. 

“No amount of indexing, clever programming, or 

beefed-up hardware can compensate for a poor 

design. Model first, get the database design right, and 
you're on your way to a database that performs well,” 
points out Michelle A. Poolet in the Web-exclusive 
article “Why Model?” (February 2006, InstantDoc 

ID 49184). She notes that a model, being a visual 

representation rather than Data Definition Language 

(DDL) code, also makes it easier to understand what 

kind of data your database contains. 

There are three types of data models: 

* A conceptual data model describes in nontech- 
nical terms what's important to an organization, 
what data the organization collects about those 
important elements, and the relationships between 
the elements. 

* A logical data model turns the information in the 
conceptual data model into a technical database 
design that illustrates how to build the database. 


* A physical data model specifies how to implement 
a logical data model in a specific DBMS, such as 
SQL Server. 


As you can see, the three models step you through 
the design process, so database design tools that sup- 
port all three models are preferable. Such tools pro- 
vide the methodology and instruments you need to 
define your organization’s data requirements, create 
a database design that meets those requirements, 
and implement that design in your DBMS. 

You build the conceptual, logical, and physical 
data models using a data-modeling notation—a 
standardized set of symbols and formats used to 
visually represent a database. Common notations 
are entity relationship (ER) modeling, enhanced ER 
modeling, Object Role Modeling (ORM), and Uni- 
fied Modeling Language (UML) modeling. Some 
notations (е.о., ER modeling) support all three 
models, whereas others (e.g., UML) support only 
the logical and physical data models. Because you 
follow the notation imposed by the database design 
tool you're using, if you're familiar with a particular 
notation you'll want to make sure that the tool you 
choose supports it. Keep in mind that the more 
notations a tool has, the more options you'll have 
when designing your database. 

Other nice-to-have visual aids include data-flow 
diagrams (DFDs) and data structure diagrams. 
DFDs illustrate the flow of data through a system 
and all the work or processing that's performed on 
that data as it moves through the system. Data struc- 
ture diagrams illustrate the relationships within an 
entity and the constraints between relationships. (If 
you're unfamiliar with the term entity, see the Web- 
exclusive sidebar “Basic Data Modeling Terms,” 
InstantDoc ID 96844, for a definition.) Data struc- 
ture diagrams can be provided as a standalone visual 
aid or as part of enhanced ER modeling. Enhanced 
ER modeling extends ЕК modeling concepts in 
various ways. For example, one extension incor- 
porates data structure diagrams, whereas another 
incorporates supertype and subtype information. 

After you determine which database design tools 
support the data models and notations you want, 
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you can start looking at those tools' features and 
capabilities. 


Automation Is Key 

Automation, perhaps the most important feature of 
a database design tool, lets you go from the concep- 
tual to the logical to the physical data model without 
having to reenter everything. Tools need automation 
capabilities to generate the code that will create the 
database—a process called forward engineering. (1 
discuss forwarding engineering later.) Some design 
tools also include capabilities such as automatic 
generation of indexes and triggers. 

Inheritance and validation are subsets of auto- 
mation. In inheritance hierarchies, a domain or 
column automatically acquires the metadata of the 
parent domain or column, which helps ensure valid 
and consistently formatted data. Inheritance also 
lets you reuse items, thereby saving time and hassle. 
Some database design tools support more advanced 
inheritance capabilities Шап others—user-defined 
inheritance and inheritance override, for example. 

Database design tools use validation to check for 
modeling errors and inconsistencies (e.g., duplicate 
names, missing items, incorrect notations) when you 
go from one data model to the next. Some tools 
provide more advanced validation capabilities, such 
as comparing and synchronizing domain properties 
between two data models and checking data in a data- 
base against business rules and defined constraints. 


Engineering and Updating 
Capabilities 

When designing a database, it helps to know what 
your existing database looks like. Reverse engi- 
neering lets you create a data model from a database 
so that you can document its structure in an effort 
to improve it or to avoid making the same mis- 
takes in the new database. You can also use reverse 
engineering to compare two databases—either dif- 
ferent databases or different versions of the same 
database—by creating a model for each and then 
comparing the models. Reverse engineering is a 
must-have feature, and every database design tool 
that I'm aware of includes it. 

Reverse engineering counterpart is forward 
engineering. Instead of creating a data model from 
a database, forward engineering creates a data- 
base from a physical data model. Database design 
tools that offer forward engineering automatically 
generate the scripts for the database by using the 
information in the physical data model. Because 
this functionality is a form of automation, most 
database designers consider forward engineering 
a must-have feature. Although almost all database 
design tools offer forward engineering, there can be 
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differences in the capabilities that are included. For 
example, some options might let you create only cer- 
tain parts of a model (e.g., certain tables) or might 
only generate a log showing the results. 

Besides generating scripts that create the data- 
base, many tools generate scripts for other purposes, 
such as loading the database with data and updating 
databases. Some tools let you generate database 
objects, stored-procedure code, and trigger code. 

Another must-have feature for most designers is 
the ability to update a database by making design 
changes to its data models. Changing a database's 
design through its data models provides an easy way 
to accommodate a company's evolving data needs. 
More important, it helps preserve the integrity of 
the database because you're not patching it—instead, 
you're updating it using the same methodologies and 
instruments you used to design it. You can also make 
sure the changes don't negatively affect performance. 


Nice Features to Have 
As I’ve said, DFDs and data structure diagrams are 
nice but not essential features to have in a database 
design tool. A few other features also fall into the 
nice-to-have category, such as the ability to undo 
and redo changes when building the model, 
and version control, which lets you save 
different versions of a model for archiving, 
comparison, and modification purposes. 
Advanced reporting capabilities are also 
nice-to-haves. Most database design tools let you 
produce reports in several formats (e.g., .html, .rtf) 
and export diagrams as image files. But some tools 
also offer capabilities such as customizable report 
templates and an add-on for creating Web sites that 
display data models. 


The Bottom Line 

Database design tools can include numerous fea- 
tures and capabilities—too many to discuss all 
the possibilities here. If you're in the market for a 
database design tool, it’s worthwhile to identify par- 
ticular special features you might want. For example, 
if you're a member of a database design team, you'll 
probably want a tool that has a central repository for 
storing designs. 

All these tools provide more capabilities than 
SQL Diagrammer does and go a long way toward 
making you more productive and helping you build 
the fast, flexible databases you need. Generally 
speaking, the more features and capabilities a tool 
has, the more it costs. If your budget allows you 
only one bell or whistle, opt for automation. Nine 
times out of 10, other features and capabilities will 
be included as part of the automation package. В 

InstantDoc ID 96845 


ORE on the WEB 


See the Web-exclusive table at 
InstantDoc ID 96845, and read 
the Web-exclusive sidebar at 


InstantDoc ID 96844. 
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SQL Server 
Services 
with 1 Click 


6 Farm Combine 1.8 (from SQL Farms) 
Q allows users to automate the execu- 
tion of scripts, queries, and updates across multiple 
databases and servers, all from a single interface. 
The latest release adds features designed to help 
enterprises manage large SQL Server environments 
and data centers. Among the improvements are en- 
hanced speed and performance, user-requested UI 
features, and expanded support for SQL services. 

Combine 1.8 helps admins manage SQL Serv- 
er services across the network by eliminating the 
need for tedious manual work such as logging on 
to individual machines and restarting services. 
Database managers can enable automated patches 
and then select the servers that they wish to restart 
from a single client interface, without requiring 
remote agents or installations. Administrators can 
also push configuration changes to all servers with 
one click from the same interface and can restart 
the appropriate services to load the configuration 
updates. According to Thomas Goff, founder and 
CTO at SQL Farms, “DBAs can now manage, 
push changes, configurations, and pull status or 
monitoring data from multple servers and easily 
manage even the largest server farms." 

Many of the new features in SQL Farm Com- 
bine 1.8 are designed for managing data at the mul- 
tiserver level, whereas earlier releases of the prod- 
uct have focused on managing individual servers. 
According to SOL Farms, new distributed man- 
agement and monitoring capabilities will be added 
to the software in future releases. 

SQL Farm Combine is $495 per SQL Server in- 
stance, which includes unlimited and installations 
and user accounts. For additional information 
email info@sqlfarms.com or visit the SQL Farms 
Web site at http://www.sqlfarms.com. 

—Diana May 
InstantDoc ID 96658 


Monitor Multiple Platforms 

on a Single Dashboard 
here are so many performance and network 
monitoring products on the market that it can 
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be difficult to differentiate one from another. How- 
ever, Heroix's Longitude 5.0 stands out not only be- 
cause it’s agentless, but also because it can monitor 
multiple platforms, such as Microsoft SQL Server, 
Oracle, and MySQL, on a single dashboard. Lon- 
gitude monitors the availability, performance, and 
operational integrity of your IT infrastructure. 

You can create service level agreements (SLAs) 
to monitor an individual network condition (e.g., 
ping request response time) or an aggregate of 
network conditions (e.g., network bandwidth and 
processor usage). Once an SLA has been created, 
Longitude's dashboard and reporting features can 
let you know when the factors being monitored by 
the SLA change—when network traffic slows, for 
example, or when processor usage exceeds your de- 
fined threshold. According to Ken Leoni, Heroix's 
vice president of product strategy, Longitude can 
also be used in virtualized environments to moni- 
tor the resources that guest OSs are using and how 
that usage is affecting the host OS. 

A real-time dashboard displays monitored data 
and makes troubleshooting easy, and the product's 
alerting capabilities notify you immediately if there's 
a performance or network problem. In addition, 
Longitude builds a repository of historical informa- 
tion that can be used for trending and predictive pur- 
poses. By default, Longitude keeps a year's worth of 
data in the repository, although that setting can be 
changed. Longitude can automatically generate per- 
formance reports and graphs, but can also produce 
custom reports for which users can specify which 
data 1s included and how that data is presented. 

This product 
is easy to install 
and can be quick- 
ly deployed. Lon- 
gitude 5.0 costs 
$299 per server to 
monitor the base 
OS and $599 to 
monitor both the 
base OS and lay- 
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ered applications. 
Longitude 5.0 starts at $299 per server to monitor 
the base OS and $599 to monitor both the base OS 
and layered applications; other components are 
extra. For additional information, visit the Heroix 
Web site at http://www.heroix.com. 500 
—Megan Веапу 
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DATABASE DEVELOPMENT 


Optimize Code Quality; Streamline Development 
т Quest Software has announced that the latest update to its Toad for SQL Server 
database development and administration utility features server-side filtering, 
SQL Server job management, and logging of executed SQL statements. A 
new difference viewer function lets you compare files and scripts and easily 
identify disparities and similarities. An updated permission editor streamlines 
managing permissions for databases, tables, functions, and stored procedures. 
For pricing and other information, contact Quest Software at 949-754-8000 or 


visit http://www.quest.com. 


DATABASE MANAGEMENT 
Manage Database Change e E QE OA UAM 


44 муч bam зе үре sp dd ito 


Change Manager 4.0 from Embarcadero Technologies includes 5 бону Schama Orme өзі бімен 
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three new utilities: CM/Data compares and validates data 
within or across database platforms, CM/Config analyzes 
database configuration attributes across database instances, 
and CM/Schema compares database schemas and generates 
synchronization scripts to aid in reconciling differences between 
them. The software supports multiple database types (e.g., SOL 
Server, Sybase, SOL Server, IBM DB2) on Linux, UNIX, and Windows platforms. Pricing begins at $795 per 
user. For more information, call 415-834-3131, or visit http://www.embarcadero.com for a free trial version. 


m 


DATA MANAGEMENT 


Manage Your Network Data 

Varonis has upgraded its data governance software, DatAdvantage 3.0, with improved scal- 
ability for collecting access event information, enhanced search functionality, and additional 
auditing and reporting options. New reporting options summarize attempts to access sensitive 
folders, inactive data sets, access-denied requests, and change details in Active Directory and 
on the network file system. Pricing starts at $25,000 for up to 250 users. For more information, 
contact Varonis at 877-292-8767 or go to http://www.varonis.com. 


DATABASE SECURITY 


Eliminate Data Vulnerabilities 

Keeping databases secure is the focus of DbProtect, a new product from Application Security. By combining 
database discovery, vulnerability scanning and assessment, activity monitoring, auditing, and optional encryp- 
tion, the DbProtect suite protects your databases from vulnerabilities and threats. Privileged-user monitoring 
prevents abuse by database and systems administrators, and a patch gap management feature helps prioritize 
security patches. DBProtect supports SQL Server, Oracle, IBM DB2, and Sybase ASE and costs $3,000 per 
database per year. For more information, call 866-927-7732 or go to http://www.appsecinc.com. 


PERFORMANCE 


Accelerate Network Performance 

SQL Nitro, a new utility from DBA24Hrs, organizes and compresses data going to and from the 

server by sending multiple Tabular Data Stream (TDS) packages per TCP packet and then com- 

pressing the larger TCP packets. SQL Nitro sits in the TCP stack to avoid drawing from server 

resources. SQL Nitro 1.2 is available for $495 directly from the vendor. For more information about 

SQL Nitro or to download a trial version, go to http://www.dba24hrs.com. ЕЙ 
InstantDoc ID 96788: 
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SoftTree Technologies 
SQL Assistant 2.5 


This toolkit provides for faster, more accurate 


SQL coding 


S Technologies SQL Assistant (SA) 2.5 
promises to enhance the productivity and effec- 
tiveness of database developers working with Micro- 
soft SQL Server 2005 and 2000; MySQL 5; IBM 
DB2; UDB 9, 8, and 7, or Oracle 10g, 91, and 87. SA 
runs in the background, and you access its produc- 
tivity features via hotkeys or a right-click menu from 
within your chosen development environment. Soft- 
Tree might have an update to SA available shortly 
after you read this, offering support for advanced 
dynamically generated code snippets; temporary 
table references and table variables; multiple code 
formatting styles; and more preconfigured SQL 
editors and IDEs. 

SA connects to the target database system to 
gain access to the object data—for example, table 
and view entries—that it needs. When possible, it 
will use an existing connection, sharing SQL Query 
Analyzer’s ODBC connection or SQL Server Man- 
agement Studio’s SMO connection for access to 
catalog tables and views. For access to user-defined 
objects, SA will open—and remember—a second 
connection by using credentials that you provide. 

To get a quick start in some familiar environ- 
ments, I installed SA to a system with a full instal- 
lation of SQL Server 2000 and another with SQL 
Server 2005. SA installed quickly, placing itself in 
the Startup group and running with a system tray 
icon. SA is preconfigured for use with 12 editing 
programs, including Notepad, Query Analyzer, 
SQL Server Enterprise Manager, and SQL Server 
Management Studio. 

SA's functions are accessible in two ways from 
within supported text editors: direct access to specific 
functions through the use of hot keys, and via inte- 
gration with the editing program’s right-click menus. 
To get a sense for SA’ native capabilities, I chose 
to start with Notepad. I started on the SQL Server 
2000 system by configuring Notepad for SQL Server 
2000—a simple process of selection from three drop- 
down boxes on the Targets tab in SA's Options menu. 
Opening a saved query in Notepad, I worked through 
SA's functions. A right-click in Notepad presented 
SQL Assistant as an option that displayed a submenu. 
Figure 1, page 46, shows an example from SQL Server 
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Management Studio. Selecting SQL Reference or 
Code Structure opens a pane along the left side of the 
window with Reference and Structure tabs. 

The Reference section lets you browse through SQL 
topics, showing a concise summary of statement 
structure and syntax, and allowing you to insert 
statement structures at the current cursor posi- 
tion in your code. With the cursor on a particular 
statement, Reference takes you directly to relevant 
information. When several statement formats are 
available—for example, Declare lets you declare a 
variable or a cursor—SA displays the alternatives. 
Clicking one of these alternatives inserts that state- 
ment structure at the cursor position. 

The Structure tab displays the overall structure of 
the statements in your code. Plus (+) and minus (-) 
characters let you expand and contract the display. 
Clicking a structure element takes you to that point 
in your code for quick navigation. 

In my tests, context-sensitive code completion was 
a real time saver. As I typed in a statement, potentially 
relevant object names from the server I was connected 
to popped up in list format. In my testing, this feature 
worked relatively quickly, exhibiting only a second 
or two delay as the code is being entered. I could 
use the mouse or keyboard to scroll through the list, 
then either left-click or press the Enter key to insert 


SQL ASSISTANT 2.5 


SoftTree Technologies, Inc 
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Nereus Computer Consulting. 


Pros: Auto completion of statement entry improves speed and accuracy for SQL 
coders; structural display allows for quick navigation through code; supports 


rapid insertion of named code snippets 


Cons: Easy to incorrectly insert context-sensitive Help into your code; room for 
some improvement in the Help and code-completion systems 


Rating: XX XX 


Price: $149 per seat, and volume discounts are available. 


Recommendation: SQL Assistant 2.5 is definitely worth checking out for both 
heavy coders and those who don't code frequently enough to maintain high skill 


levels. 


Contact: SoftTree Technologies, Inc. • www.softtreetech.com e 800-289-9256 
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SQL ASSISTANT 2.5 


the selected item. For example, when completing 
a SELECT statement, keying the space following 
the SELECT keyword displayed a list that included 
system tables, views, functions, and databases. Keying 
the first letter of the 
desired object name nar- 
rowed the list. Using the 
arrow keys or typing on 
the keyboard took me to 
the desired object, and 
pressing Enter inserted 
the name. When entering 
a fully qualified field 
name, keying the period 
(.) character displayed 
the names of the next 


Figure 1 


SQL Assistant within SOL 
Server Management 


level of objects, allowing 
the automatic statement completion to continue. At 
the completion of a field, keying the comma (,) char- 
acter displayed the original object list again to let me 
select another field name. 

Another useful component is the syntax checker, 
which you access via the right-click menu or (by 
default) the Control + F9 hotkey combination. Error 


messages appear in a pane at the bottom of the 
window. I noticed that error detection 1s sometimes 
progressive: Correcting one error will reveal others. 

Other functions performed by SA include code 
formatting and rapid insertion of named code snip- 
pets. A code reference to a stored procedure turns 
into a hotlink when you place the cursor on the 
name and press the Control key. Clicking the link 
displays the underlying procedure definition. 

SA can be a very useful tool, speeding up SQL 
coding and improving your accuracy. That said, I 
do see some room for improvement. For example, 
when building a Select statement the ability to select 
and insert multiple field names in a single opera- 
tion would be helpful. I would also like to be able 
to create a complete skeleton statement in a single 
operation in a way that would allow the context- 
sensitive auto-completion tool to help complete the 
various clauses of a complete statement. Neverthe- 
less, I believe that SA 1s well worth the price for those 
who spend their days coding and for those who have 
a regular need to construct SQL code—but perhaps 
not enough to maintain top skill levels. [SQL 
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Connecting SQL Server to the IBM System i 
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E data between SQL Server databases 
and other systems 1s a challenge. One of the 
most common systems that you might connect to 1s 
the IBM System i (formerly known as the iSeries or 
the AS/400). Widely used in ERP, manufacturing, 
and distribution systems, the System i is a relational 
database system that supports a variety of connec- 
tions to SQL Server. To connect, you need to install 
the System i data access middleware on your SQL 
Server system. Although there are several third- 
party drivers, the ODBC, OLE DB, and Microsoft 
‚МЕТ data-access components in IBM’s iSeries 
Access are high quality, and there’s no license fee 
for any company that has licensed IBM's 15/OS. 
After you make the connection, you can transfer 
data between SQL Server databases and the System 
i in several ways. 


BCP 

This venerable yet still fully functional data import 
and export tool can both import and export data 
from a SQL Server database. However, BCP is old 
school and to use it, you'll need to build in some 
type of file transfer process to transfer the BCP file 
to the System 1. Most companies use FTP, but you 
can also use IBM's NetServer. If you're using FTP, 
the FTP QUOTE RCMD subcommand can kick 
off jobs on the System 1 that will process the files 
after the FTP transfer is done. FTP is built in to both 
systems so this method doesn’t require iSeries Access 
middleware, but it is a multistep process. 


IT'S A STORY 
ABOUT HOUJ 
ENGINEERING 
MAKES YOU 
SEXTER. 


scottadams@aol.com 


www.dilbert.com 


Replication 

SQL, Server and the System i can also use snapshot 
replication to transfer data between the two systems. 
SQL Server needs to act as the Publisher, and the 
IBM iSeries Access ODBC provider needs to be 
installed on the SQL Server system. 


Linked Servers 

Linked servers can be great way to connect the 
System i to SQL Server if you need ad hoc interac- 
tive data access between the two systems, especially 
if you need to tie together applications running on 
SQL Server with data that resides on the System i. 
This method works best for smaller transactions 
such as looking up column values, but it can also 
be used for row-level updates. You can connect a 
SQL Server linked server to the System 1 by using 
the iSeries Access ODBC driver or the IBMDA400 
or IBMDASQL OLE DB providers. 


SQL Server Integration 
Services (SSIS) or DTS 
Great for bi-directional bulk transfers of data, SSIS 
and DTS packages can be scheduled to run peri- 
odically or on demand. SQL Server 2005 uses SSIS; 
SQL Server 2000 and SQL Server 7.0 use DTS, 
its precursor. Unlike BCP, DTS and SSIS provide 
direct database-to-database transfers. DTS connects 
using ODBC and OLE DB. SSIS can use these or 
better yet, the newer IBM .NET Data Provider for 
DB2. 5001) 
InstantDoc ID 96679 
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SOL safe v4.5 
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SOL diagnostic manager v5.0 
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Try Idera’s new SOL Server management solutions for yourself free for 14 days. Download everything 
you need for the SOL Server ultimate management solution at: 


